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| EDITORIAL 


Alarm Bells Should Help Us Refocus 


WE'RE HEARING ALARM BELLS THESE DAYS ABOUT SCIENCE IN THE UNITED STATES. ON THE 
one hand, we’ve been told that in the global economy of today’s “flattened” world, we need to bolster 
innovation and competitiveness and science and engineering research and education. Earlier this year, 
when President Bush announced his American Competitiveness Initiative, the future appeared 
brighter for the physical sciences, math, and engineering (although the National Institutes of Health 
budget remains flat). But other alarms have sounded that the increases may be at the expense of the 
disciplines that have historically sought to understand how all this hard work actually helps societies 
deal with these very issues. Last month, Senator Kay Bailey Hutchinson (R-TX), chair of a Senate 
panel that oversees the U.S. National Science Foundation (NSF), aggressively argued that the agency 
should limit its funding for the social sciences and focus on the “hard” sciences. Although the 
committee stopped short of tying NSF’s hands, Congress has yet to make a final decision on whether 
or not competitiveness is just about technology. Congress should think hard about this. 

In the past, investments in science have brought breakthrough 
technologies, a productive technical workforce and positive trade 
balance in the high-tech sector, and medical miracles, along with 
many other tangible benefits. Most Americans believe they are 
healthier and better off because of the nation’s long-standing 
preeminence in science and technology. Moreover, because 
other nations are replicating our blueprint for research and 
higher education with increasing success, competition is 
growing fierce. So fierce, that our country’s present and future 
position in the world economy is at considerable risk. 

All this challenges our political leaders, but it should also 
challenge the broad scientific community to make sure that our 
science actually helps provide what most Americans need. 
Clearly, this requires an aggressive and ambitious program of 
basic research in the hard sciences, including physics, chemistry, materials science, mathematics 
and computer science, biology and biomedical science, earth and space sciences, and engineering. 
But that will not be enough. 

Over decades, as our scientific knowledge has become more sophisticated, we have come to 
recognize how such things as human dynamics and institutional behavior can either enhance or 
impede the benefits to society of our research achievements. But recognizing that reality is only the 
first step. We need a much better understanding of how new technical knowledge and tools translate 
into products, jobs, and wealth; how people learn; how offshoring of jobs, even technical jobs, 
affects our workforce and quality of life; how increased investment in science and engineering 
research leads to increased industrial productivity and to better jobs; and how to cope with a host 
of ever-changing societal problems. These issues are the domain of the social sciences, which also 
need increased federal support. But that still is not sufficient. 

The successful application of new knowledge and breakthrough technologies, which are likely to 
occur with ever-increasing frequency, will require an entirely new interdisciplinary approach to 
policy-making: one that operates in an agile problem-solving environment and works effectively at 
the interface where science and technology meet business and public policy. It must be rooted in a 
vastly improved understanding of people, organizations, cultures, and nations and be implemented by 
innovative strategies and new methods of communication. All of this can occur only by engaging the 
nation’s top social scientists, including policy experts, to work in collaboration with scientists and 
engineers from many fields and diverse institutions on multidisciplinary research efforts that address 
large but well-defined national and global problems. This will not be easy. It will require qualitative 
changes in research cultures and in the way federal agencies consider research funding. 

Cynics may dismiss these concerns with an abrupt, “We’ve seen all this before.” I believe they 
are wrong and it would be folly to ignore the alarm bells. Rather, let us use these sometimes shrill 
warnings to help us refocus and regain the high road for the 21st century for science, the nation, and 
all of humanity. Albert Einstein eloquently framed this issue for scientists in 1931 at the California 
Institute of Technology: “Concern for man himself and his fate must always constitute the chief 
objective of all technological endeavors . . . Never forget this in the midst of your diagrams and 
equations.” Congress, as well as scientists, should remember these words. 

— Neal Lane 


10.1126/science.1131478 


www.sciencemag.org SCIENCE VOL312 30JUNE 2006 
Published by AAAS 


1847 


1854 


CLIMATE CHANGE 


Empathetic 
mice? 


Yes, It's Been Getting Warmer in 
Here Since the CO, Began to Rise 


WASHINGTON, D.C.—The last decades of the 
20th century were most likely warmer than any 
comparable period in the past 1000 years, a 
National Research Council (NRC) panel 
announced* at a press briefing here last week. 
The expert committee thus confirms the out- 
lines of the near-iconic “hockey stick” tempera- 
ture curve—a long cooling followed by a sharp 
warming during the past millennitum—that had 
become a favorite target of greenhouse contrari- 


hockey stick), then rose sharply into the 20th 
century (the blade) until it topped the relative 
warmth of 800 to 1000 years ago. That turn- 
around suggested that humans played a hand in 
the recent warming. 

After the hockey stick appeared prominently 
in a 2001 international climate assessment, the 
critics rushed in. Skeptics said Mann and col- 
leagues had erred badly in their statistical analy- 
sis, and some hinted at deliberate distortion. 


Surface Temperature Reconstructions for the Last 2000 Years 
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Warped sticks. The latest millennial temperature records (produced since the “hockey 
stick” came out using proxies such as tree rings) may have more squiggles, but they 
support a recent sharp warming to record high temperatures. 


ans. But the committee also says the evidence in 
parts of the stick is fuzzier than the public and 
many scientists might have thought. 

The hockey stick arose from work published 
in 1998 and 1999 by statistical climatologist 
Michael Mann of Pennsylvania State University 
in State College and two colleagues. They com- 
piled 12 Northern Hemisphere temperature 
records spanning the past millennium, using cli- 
mate proxies such as the width of tree rings and 
the chemical composition of corals. The result- 
ing temperature curve sloped gently downward 
for most of the millennium (the handle of the 


* Surface Temperature Reconstructions for the Last 
2,000 Years, National Research Council, available at 
fermat.nap.edu/catalog/11676.html 


The NRC commit- 
tee, chaired by meteor- 
ologist Gerald North of 
Texas A&M University 
in College Station, gener- 
ally supported Mann’s 
work. “We do roughly agree 
with the substance of their finding,” 
said North. Mann’s group sometimes erred, the 
committee found. “Some of their choices could 
have been made better,” said statistician and 
committee member Peter Bloomfield of North 
Carolina State University, Raleigh, “but it was 
quite plausible at the time.” In any case, the mis- 
steps “didn’t have a material effect on the final 
conclusion,” he said. And similar studies have 
followed from a half-dozen other groups, all 


Plastic 
from fruit 


giving the warm-cool—-much warmer pattern. 

In addition, none of the three committee 
members at the press briefing—North, 
Bloomfield, and paleoclimatologist Kurt 
Cuffey of the University of California, Berke- 
ley—had found any hint of scientific impro- 
priety. “I certainly did not see anything inap- 
propriate,” said North. “Maybe things could 
have been done better, but after all, it was the 
first analysis of its kind.” 

Although the committee generally sup- 
ported the work Mann led, “there’s a disagree- 
ment about how sure we are” about some of the 
study’s conclusions, said North. The committee 
has “high confidence” that the late 20th century 
was the warmest period of the past 400 years— 
a time when high-precision proxy records are 
abundant. That’s consistent with the idea that 
recent warming was in large part human- 
induced, Cuffey noted. But the committee has 
“less confidence” in Mann’s conclusion that 
recent temperatures have set a record for the 
entire millennium. “The committee concluded 
that Mann and his colleagues underestimated 
the uncertainty” in the earlier part of the record, 
said Cuffey, for which records are of lower 
quality and fewer in number. “In fact, these 
uncertainties aren’t fully quantified,” he said. 

When pressed, statistician Bloomfield 
characterized the committee’s lesser confi- 
dence in the millennial result as “more at the 
level of 2:1 odds” that Earth is now warmer 
than it has been in at least 1000 years. The 
committee has “even less confidence” in 
Mann ef al.’s 1999 conclusion that “the 1990s 
are likely the warmest decade, and 1998 
the warmest year, in at least a millen- 
nium.” “That’s plausible,” said 

Cuffey. “We don’t know if it’s 
true or not.” A year or a decade is 
just too short an interval for 
comparison to the older pale- 
otemperature record, he said. 

Whether 2:1 odds for a mil- 
lennial record are good or poor 
turns out to be in the eyes of the 
beholder. Long-standing critics saw 
the report confirming that the hockey 
stick had not stood up to scrutiny; defenders 
saw support for key findings. The committee, 
for its part, stressed that the hockey stick and 
other records resembling it are not the only evi- 
dence of human-induced warming, “and they 
are not the primary evidence.” Cuffey, for one, 
argued staunchly that the case for anthro- 
pogenic global warming is compelling, with or 

without the hockey stick. 
—-RICHARD A. KERR 
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Journal Letter Spotlights China’s Bird Flu Reporting 


A bizarre episode surrounding the publication 
of a letter in this week’s New England Journal 
of Medicine (NEJM) has again focused atten- 
tion on China’s willingness to share public 
health information. 

The letter details the case of a 24-year-old 
Chinese man who died in November 2003. 
SARS was initially suspected, the Chinese 
authors report, but tests for the SARS virus 
were negative. Subsequent tests on the 
patient’s stored tissue samples turned up the 
H5N1 avian influenza virus, the letter states. 
The death occurred 3 months before China 
officially reported any H5N1 outbreaks in 
poultry and 2 years before it officially reported 
its first human case. 

The NEJM letter might not have made such 
a splash were it not for a last-minute attempt to 
retract it. On 21 June, the day before publica- 
tion, NEJM received e-mails purportedly from 
corresponding author Wu-Chun Cao at the 
State Key Laboratory of Pathogens and Bio- 
security in Beijing requesting that the article 
be withdrawn. Because the issue had already 
been printed, NEJM editors sent an e-mail to 
the journal’s media subscribers and posted a 
note on its Web page advising that the letter 
had been retracted. Then on 23 June, NEJM 
sent out another announcement saying that 
Cao had contacted the journal by phone and 
fax and claimed that he had not sent the 
e-mails and had not requested that the report 
be withdrawn. “And so it stands as published 
in the issue of June 22,” reads the e-mail from 
Jeffrey Drazen, NEJM editor-in-chief. The 
episode prompted speculation about govern- 
ment censorship, but the interpretation 
remains murky. Science could not reach Cao or 
his co-authors for comment. 

Masato Tashiro, director of the World 
Health Organization (WHO) Collaborative 
Center for Influenza Surveillance and Research 
at Japan’s National Institute of Infectious Dis- 
eases, says the initial suspicions about SARS 
described in the report are understandable. 
“Clinically, SARS looks just like avian 
influenza,” he says. Nor would it be surprising 
if cases of bird flu were missed among the thou- 
sands of patients presenting with flu symptoms 
and pneumonia in China each year, especially 
in the early days of the outbreak, he says. 

Flu experts have long suspected that HSN1 
was circulating either undetected or unreported 
in southern China, probably since the first out- 


break of the disease in Hong Kong in 1997. 
Among other evidence, two members of a 
Hong Kong family tested positive for HSN1 
after a trip to the mainland’s Fujian Province in 
February 2003, and bird flu is suspected in a 
third member of that family who died in Fujian. 


Family tragedy. Survivi 
extended Ginting family, which los’ 
to bird flu, gather to mourn. 
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Human Transmission But No 
Pandemic in Indonesia 


Bird flu experts meeting in Jakarta last week 
concluded that a rare instance of human-to- 
human-to-human transmission had indeed 
occurred within a large family cluster in 
Indonesia. But they said there is no sign that 
the virus is becoming more dangerous and also 
discounted criticism of the government's han- 
dling of the cluster, which occurred in rural 
northern Sumatra in May. 

With 51 human cases and 39 deaths 
reported so far, Indonesia is the second-hardest 
hit country after Vietnam. Vietnam brought its 
bird flu outbreak under control last year, but the 
number of poultry outbreaks and human cases 
continues to rise in Indonesia. The Sumatra 
family cluster is the largest documented to date. 

Epidemiologic and genetic sequencing data 
suggest that a 10-year-old boy contracted the 
virus from his aunt and then passed it on to his 
father, concluded the experts, who were con- 
vened by the World Health Organization 
(WHO). (Six blood members of the family have 
died of H5N11 infection, and it is suspected in a 
seventh member who was buried before tissue 
samples were collected.) 


Roy Wadia, a spokesperson for WHO in 
Beijing, says the timing of events raises 
questions “about the information-sharing 
mechanism.” Cao’s institute comes under the 
Academy of Military Medical Sciences, and 
his co-authors are affiliated with the 


Although such localized “second genera- 
tion” transmission has never been confirmed 
before, it is not unduly alarming, says Masato 
Tashiro, director of the WHO Collaborative 
Center for Influenza Surveillance and Research 
at Japan's National Institute of Infectious Dis- 
eases. The experts found evidence that the 
virus had mutated slightly as it circulated 
among family members, but the changes 
occurred in a genetic region that does not 
affect transmissibility, he says. 

Steven Bjorge, a WHO epidemiologist in 
Jakarta, defended the country’s handling of the 
cluster, noting that health officials began 
investigating 2 days after family members 
appeared at a private clinic. He admits that the 
country faces an uphill battle in containing 
poultry outbreaks. Progress is being made 
tracking outbreaks in poultry with a pilot sur- 
veillance scheme that involves local officials 
and citizens, says Bjorge. But the country will 
need financial help if it is to extend such a pro- 
gram across Indonesia’s 17,000 inhabited 
islands, which stretch over three time zones. 
Despite pledges made at an international 
donor meeting in Beijing in January, Indone- 
sian officials said last week that not one cent 
had arrived in their country. -D.N. 
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Institute of Microbiology and Epidemiology, 
a People’s Liberation Army hospital, and the 
Beijing Genomics Institute. Wadia believes 
that China’s Ministry of Health was unaware 
of this case until news of the NEJM paper 
started circulating just before its publication. 
WHO has asked the ministry to investigate. 
“There is a public health significance [to the 
timely sharing of information] that can’t be 
stressed too strongly,” says Wadia. 

A prominent Chinese military biologist 
who asked not to be identified says that 


WAR ZONE 


Chinese civilian and military researchers 
often do not share key research results 
because of fears that findings will be 
poached. Although he has no direct knowl- 
edge of the NEM letter, he speculates that “it 
is most likely that the HSN1 patient was hos- 
pitalized in a military hospital”; otherwise, 
the military-affiliated research group would 
never have acquired the tissue samples. 

—DENNIS NORMILE 
With reporting by Gong Yidong and Jia Hepeng in 
Beijing. 


Targeted for Murder, Iraqi Scientists 


Named on a Hit List 


If you want to know how bad it is for scientists 
in Iraq these days, just ask Nazar Al-Anbaky. 
In the spring of 2005, a close friend, agrono- 
mist Awad Esa, director general of the Min- 
istry of Agriculture’s extension division, was 
gunned down by masked men as he was leav- 
ing work. Another colleague, Rafid Abdal 
Alkareem, head of the animal-welfare board, 
fled Iraq after surviving two assassination 
attempts. Faced with persistent threats, the 
ministry last fall dispersed most personnel 
around Baghdad. “I wasn’t able to do my work. 
The danger was everywhere,” says Al-Anbaky, 
who was deputy chief of the ministry’s plant 


Assassinations of Iraqi Academics 


verified by several Iraqi scientists as authentic. 
Last week, rectors of six universities in Spain 
issued a statement warning of “a very grave 
outrage against the cultural and scientific 
development” of Iraq and urging authorities to 
investigate “the killing campaign.” 

For Iraq’s beleaguered scientists, the hit list 
aggravates a desperate situation. Since the 
U.S.-led coalition invaded in April 2003, at 
least 188 Iraqi academics have been slain, 
according to a tally by the Spanish Campaign 
Against Occupation and for the Sovereignty of 
Iraq, based in Madrid. Over the past 3 years, the 
pace has increased (see graph). In that period, 
some 220 doctors have been killed 
and more than 1000 have left Iraq, 


(23) 


the health ministry reported last 
February. Hundreds of scientists 


have fled the country. “This brain 
drain will adversely affect Iraq’s 


development for years to come,” 
says Jafar Jafar, head of Iraq’s 


nuclear program under Saddam 
Hussein. Jafar, general manager of 
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Trimesters 


Mounting toll. The murder rate of Iraqi academics has risen 


steadily since the April 2003 invasion. 


protection research department. So in March, 
he too quit Iraq. 

For months, Iraqi academics have de- 
nounced what they view as an unspoken cam- 
paign to cripple the country’s intellectual elite 
(Science, 30 September 2005, p. 2156). Now 
they face an overt new threat. An unidentified 
group is circulating a hit list of 461 Iraqi intel- 
lectuals. The existence of leaflets calling for 
the assassination of named individuals was 
reported by the newspaper Az-Zaman last 
month; Science has obtained a copy of the list, 


www.sciencemag.org SCIENCE VOL312 30 JUNE 2006 


Uruk Engineering Services in 
Dubai, says he has helped “many 
friends and acquaintances” find 
jobs elsewhere. 

The killers are largely unknown. 
Some murders are sectarian: Sunni 
militias targeting Shiite academics 
and vice versa. Overall, however, the assassina- 
tions “do not follow any religious or sectarian 
pattern,” says Ismail Jalili, an ophthalmic 
surgeon who presented an in-depth analysis at a 
conference in Madrid last April. 

In some cases, money is a motive. One 
recent victim was Ali Hassan Mahawish, dean 
of engineering at Al-Mustansiriya University 
in Baghdad, who told Science last September 
how several professors in his department had 
gone overseas on sabbatical, depleting the 
faculty. He was seized by gunmen in March. > 
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Denice Denton (1959-2006) 


Denice Denton, chancellor of the University of 
California (UC), Santa Cruz, and a champion 
of diversity in science and engineering, 
jumped to her death from a San Francisco 
apartment building on 24 June. 

An electrical engi- 
neer, Denton became 
the first female dean 
of engineering at a 
major research univer- 
sity when she came to 
the University of 
Washington (UW), 
Seattle, in 1996. At 
UW, she established 
programs to bring 
more women and 
minorities into engineering and introduced 
policies to enable female faculty members to 
balance work and family. “She had the ability to 
make everyone feel included,” says Eve Riskin, 
an electrical engineering professor at UW. 

In February 2005, she moved to UC Santa 
Cruz, where she was criticized for helping her 
partner, materials scientist Gretchen Kalonji, get 
a UC administrative job and for the $600,000 
spent on renovations to her campus home. 

Last fall, Denton expressed frustration 
about her job during a meeting on women 
and science at the U.S. National Academies. 
“It's lonely at the top. No one has on their list 
of things to do, ‘Be nice to the dean or the 
provost today. [Ask yourself] what can | do to 
support them in their endeavors for social jus- 
tice?’ " A colleague said Denton confided on 
3 June that she was “very demoralized” and 
“didn't know how much more she could take.” 

“Denice was an accomplished and pas- 
sionate scholar whose life and work demon- 
strated a deep commitment to public service 
and to improving opportunity for the dis- 
advantaged and underrepresented,” says 
UC President Robert Dynes. An interim chan- 
cellor is expected to be appointed soon. 

-YUDHIJIT BHATTACHARJEE 


A Taste for Variety 


MELBOURNE, AUSTRALIA—The Human 
Variome Project—a planned database of all 
variant forms of human genes and their phe- 
notypes—got off the ground here last week. 
Fifty-five international experts met to lay out 
a framework and name geneticist Richard 
Cotton their chief. Cotton says the project, 
which aims to cull global mutations data from 
databases and medical records to understand 
disease, needs $60 million. 

—-ELIZABETH FINKEL 
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“The ransom was paid, but his family got a 
dead body,” says a colleague in Baghdad who 
asked to remain anonymous. The latest drama 
involves petroleum scientist Muthna 
Al-Badery, a top official in the Oil Ministry, 
who was kidnapped earlier this month. 
“Bargaining is still continuing for his life,” the 
Baghdad scientist says. 

The hit list includes scientists, university 


ETHICS 


officials, engineers, doctors, and journalists in 
Baghdad and other cities. “The list is part of an 
organized, foreign-backed campaign to terror- 
ize Iraqi brains,” an official with the Iraqi 
Writers Union told Az-Zaman. No one con- 
tacted by Science knows who issued the list. 
One prominent scientist with ties to Iraq’s 
intelligence community says that Iraqi investi- 
gators are probing claims that Iranian intelli- 


Blocking a Book, Dutch University 
Rekindles Furor Over Nobelist Debye 


A controversy about the alleged Nazi sympa- 
thies of Dutch chemistry Nobel laureate Peter 
Debye has escalated. Utrecht University last 
week halted publication of a pro-Debye book 
by an employee and ordered staff not to dis- 
cuss the issue with the press. The move follows 
a university decision last February 
to strip Debye’s name from its 
institute for nanomaterials. 

A science historian, mean- 
while, has spoken out in Debye’s 
defense, as has another Dutch 
Nobel laureate, Martinus Veltman. 
Cornell University, where Debye 
was a professor from 1940 until his 
death in 1966, has concluded from 
its own 3-month investigation that 
there’s no reason to distance itself 
from him, as has the American 
Chemical Society (ACS). 

The flap erupted after the publi- 
cation of a harsh view of Debye—a 
physical chemist who led the 
Kaiser Wilhelm Institute for 
Physics in Berlin from 1935 to 
1939—in Einstein in Nederland: 
Een intellectuele biografie by 
Berlin-based journalist and sci- 
ence historian Sybe Rispens. One 
chapter, excerpted in a weekly 
magazine, documented that 
Debye, as president of the German 
Physical Society (DPG), asked 
Jewish members to resign ina 
1938 letter signed “Heil Hitler!” It also 
claimed that Debye stayed in touch with Ger- 
man authorities while at Cornell, even offering 
to return to Berlin in June 1941. 

In a brief statement issued on 16 February, 
Utrecht University’s board said it would 
rename the Debye Institute, and Maastricht 
University said it would no longer award the 
Peter Debye Prize (Science, 3 March, p. 1239). 
Gijs van Ginkel, managing director of the 
“former Debye Institute,” as it now calls itself, 
responded by writing a book containing an 


. 


Counter-counterattack. Utrecht University has 
halted the publication of a book that countered 
allegations about Peter Debye published in an 
earlier book (inset). 


analysis of historical documents, his view of 
the affair, and a sharp attack on Rispens. 

But the university has halted its publica- 
tion. Van Ginkel referred questions to univer- 
sity spokesperson Ludo Koks, who denies that 
academic freedom is at stake; Koks says Van 


Ginkel had broken an agree- 
ment not to include personal comments in the 
publication. Koks confirms that institute 
staffers have been ordered not to talk to the 
press to “streamline communications.” 

Mark Walker, a historian at Union College 
in Schenectady, New York, who studies sci- 
ence and technology in the Nazi era, says that 
although Debye “didn’t show civic courage, ... 
all the evidence is that he was not a Nazi 
sympathizer.” For example, the DPG purged 
its Jewish members much later than most other 


gence agents are involved. The U.S. Embassy 
was not aware of the list, says spokesperson 
Dennis Culkin. 

One thing is certain: The campaign has 
cast a pall over Iraqi academia. Says one 
engineering professor who is sticking it out 
in Baghdad, “We carry our coffin every day 
we go to work.” 

—-RICHARD STONE 


scientific societies did, and without any enthu- 
siasm whatsoever, he says. Signing official let- 
ters with “Heil Hitler!” was nothing unusual, 
even among those openly opposed to the 
regime. That Debye tried to keep communica- 
tion channels to Germany open while at Cor- 
nell is also “absolutely reasonable,” Walker 
says, because his daughter still lived there. 

Walker recently gave a lecture about 
Debye at Cornell, where the affair “was 
something we just couldn’t ignore,” says 
Héctor Abrufia, chair of the Department of 
Chemistry and Chemical Biology. “Debye 
has had such a huge influence here.” Ina 
1000-word letter submitted for publication to 
Chemical and Engineering News, Abrufia 
says a review shows that removing Debye’s 
name from a professorship and a lecture 
series would be “unwarranted.” Banning 
books is “not what universities should be 
about,” Abruna adds. 

ACS sees “no compelling reason to do any- 
thing” about its Peter Debye Award in Physical 
Chemistry either, says Gordon McCarty, chair of 
ACS’s Committee on Grants and Awards; 

DuPont, the awards 

sponsor, is “quite com- 

fortable” with that 
stance, he adds. 

Rispens says he 
opposes silencing dif- 
ferent views on Debye 
and would welcome a 
study that went beyond 
his own focus on Albert 
Einstein’s circle. But the 
affair has cost Rispens 
the support of one enthu- 
siastic fan: Veltman, 
who, in a foreword to 
Rispens’s book, praised it 
as “a nugget of gold.” Ina 
5 May open letter to Debye Institute staff, 
Veltman says he took Rispens’s assertions “at 
face value” at the time but now realizes “they 
should be assigned to the realm of fables.” 
The foreword will not appear in new editions 
or translations of the book, Veltman contin- 
ued; the two universities “should admit their 
error, revoke their decision, and forget the 

matter,” he says. 
—MARTIN ENSERINK 


30 JUNE 2006 VOL312 SCIENCE www.sciencemag.org 


Published by AAAS 


CREDIT (LEFT): AP PHOTO 


CREDITS (TOP TO BOTTOM): B. EYMANN, COURTESY OF THE FRENCH ACADEMY OF SCIENCES, Y. O'BANNON, COURTESY OF THE INTERACADEMY COUNCIL 


DIVERSITY 


Report Urges National Academies 
To Improve Status of Women 


Johanna (Anneke) Levelt Sengers stands at the 
top of her profession but confesses that “it can 
be a little lonely” as one of only two women in 
the 82-member engineering sciences section of 
the U.S. National Academy of Sciences (NAS). 
A scientist emeritus at the National Institute of 


Standards and Technology, she 
belongs to both NAS and its partner, 
the National Academy of Engineer- 
ing, where she’s one of seven 
women within the 173-member 
chemical engineering section. So in 
late 2004, when she was asked to 
co-chair an international panel on 
women in science with Manju 
Sharma of India, they decided to 
examine not just women’s place in 
society but also their status within 
the 90 national academies that had 
requested the report. 

The report, posted last week by 
the InterAcademy Council (IAC) 
(www.interacademycouncil.net), 
offers a refreshingly candid assess- 
ment of the problems facing women 
trying to enter and move up in the 
world of science and engineering. 
Although it strikes familiar chords 
about the need to remove barriers 


and increase opportunities for girls and women, it 
sings a new tune in commanding the national 
academies themselves to “first put their own 
houses in order.” In addition to choosing more 
women as members and leaders of their organiza- 
tions, each national academy should form a stand- 
ing committee on diversity to gather and discuss 


gender-related data, it says. 


“Wow. This is far more hard-hitting and to 
the point than I had expected,” says Donna 
Dean, president of the Association for Women 
in Science in Washington, D.C., and a former 
senior administrator at the National Institutes 
of Health, who is now at the Washington, 
D.C., science-lobbying firm of Lewis-Burke 
Associates. “It tells the various academies to 
stop pontificating about the right thing to do 
and start showing it in how they operate.” 


* SENGERS 


Against the odds. Levelt Sengers 
helped write a report on women for an 
international council of national acade- 
mies whose 15-member board (shown 
below in January 2006) is all male. 


73 


there’s 


The report was funded in part by a $50,000 
grant from LOreal. Since 1998, the France-based 
cosmetics company has honored outstanding 
women scientists around the world—including 
five of the eight women on the 10-person [AC 
panel. Jennifer Campbell, who heads the 


company’s philan- 
thropic efforts, says 
she would like to see 
across-the-board parity 
for women in science. 
But Levelt Sengers says 
she thinks that “a 
reasonable goal would 
be no major disparity 
between the percentage 
of Ph.D. degrees awarded 
to women in a particu- 
lar field and the per- 
centage elected in that 
field”? Most academies 
are a far cry from 
reaching even that 
level; the 2% figure for 
NAS women in chemi- 
cal engineering, for 
example, pales beside 
the 14% of U.S. Ph.D.s 
awarded in the 1980s, 


much less the 22% awarded in the 1990s. 

NAS President Ralph Cicerone says that 
no magic bullet” for adding women to 
the academy’s ranks but that NAS is trying to 
increase their chances of gaining the type of 
recognition—through service on academy pan- 
els, keynote speeches, and major scientific 


awards—that traditionally leads to NAS mem- 


bership. NAS has no plans “to collapse its 
activities into one committee on gender issues,” 
he says, adding that the challenge calls for 
“a sustained effort ... along the entire pipeline.” 

Levelt Sengers says that each academy 
must come up with individual remedies, which 
she hopes will be discussed during the coun- 
cil’s next meeting in December in Cairo, 
Egypt. Dean suggests a radical approach to 
staffing the academies, many of which operate 


extensive networks of 
institutes and labora- 
tories. “What about 
strategic buyouts to 
senior managers, like 
companies do?” she 
asks. “It wouldn’t be 
easy or welcome. But 
business as usual just 
won't get you there.” 
—JEFFREY MERVIS 
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Discovery Carries Heavy Load 


A successful trip for space shuttle Discovery, 
set for launch this weekend, will boost 
prospects for repairing the Hubble Space 
Telescope and completing the half-built 
international space station. Despite concerns 
by safety officers, NASA approved the second 
shuttle mission since February 2003, when 
Columbia disintegrated upon return. Even if 
all goes smoothly, the agency said last week, 
it plans to scale back U.S. research aboard 
the station by eliminating a centrifuge and 
other scientific equipment. 

-ANDREW LAWLER 


Get Your Shots 


Thirty British doctors this week called for 
responsible media coverage amid published 
doubts about the measles, mumps, and rubella 
(MMR) vaccine. Concerns over MMR have 
accompanied a decline in the number of vacci- 
nated U.K. children from 93% in 1995 to 
83% in 2005, and the doctors cite a “dramatic” 
rise of measles this year as well as a fatality, the 
first in 14 years. To blame, they say, is a 1998 
Lancet paper linking the jab to autism by 
Andrew Wakefield, who was charged with mis- 
conduct this month by the U.K.’s General Med- 
ical Council. “Illness or death” could befall 
unimmunized children, the signatories warn. 
-LAURA BLACKBURN 


Patently Obvious? Ask 
The Supremes 


Is the U.S. government granting patents for 
inventions that are obvious? This week, the 
U.S. Supreme Court accepted a case that 
biotech attorneys say could make new patents 
harder to obtain. KSR International v. Teleflex 
involves a dispute over a gas pedal, but the 
case has “tremendous implications for biotech,” 
says Hal Wegner of Foley & Lardner LLP in Wash- 
ington, D.C. That’s because the court could 
toughen a standard used to determine the 
validity of an application that combines ele- 
ments of published ideas or patents. The stan- 
dard is whether the published work includes a 
specific “suggestion” to combine existing parts. 
A 2004 National Academies panel called for 

“a stricter standard” to improve biotech patent 
quality. Arguments are set for autumn in the 
case, which is expected to pit the software indus- 
try against the biotech and pharma sectors. 

In other patent news, last week the justices 
decided not to act on a case they had heard 
involving whether scientific information can 
be patented (Science, 17 February, p. 946). 

—-ELI KINTISCH 
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PALEOCLIMATOLOGY 


Atlantic Mud Shows How Melting 
Ice Triggered an Ancient Chill 


Eighty-two hundred years ago, a chill swept 
around the Northern Hemisphere, the last, fee- 
ble gasp of the mighty 100,000-year ice age that 
preceded it. Geologists looking for a cause had 
glimpsed evidence that a vast outpouring of gla- 
cial meltwater had gushed into the Atlantic 
Ocean less than a year after an ice dam busted. 
That meltwater outburst, seven times as volu- 
minous as all five present-day Great Lakes, had 
come suspiciously close to the 8200-year, or 
“8K,” chill. But paleoceanographers couldn’t 
answer the big question: How could it have 
affected deep-sea currents believed to play a 
pivotal role in controlling climate? 

Now, paleoceanographers report on page 
1929 that they have found a single ocean sedi- 
ment core that preserves the sought-for link. 
An ocean current, the so-called conveyor, car- 
ries climate-moderating heat into the far north, 
where it sinks to the bottom and heads south. 
The core recorded both a gush of freshwater 
far out into the North Atlantic and the nearly 
simultaneous slowing of that conveyor. “I’m a 
believer” in the meltwater-conveyor-climate 
link, says paleoceanographer Nicholas 
McCave of Cambridge University in the U.K., 
who is not an author of the paper. 

The core came from south of Iceland on the 
Gardar Drift, where muddy sediment collects 
10 to 20 times faster than is usual in the deep 
sea. With more sediment per year to work with, 
paleoceanographers Christopher Ellison and 
Mark Chapman of the University of East Anglia 
and Ian Hall of Cardiff University, both in the 
U.K., could sample smaller bits of time and thus 
read a more detailed history. To gauge the 
changing temperature of surface waters, they 
measured the abundance of a cold-loving 
plankton species. They calculated the salinity of 
the seawater from the ratios of oxygen isotopes 
in the plankton’s shells after adjusting for the 
effect of temperature. Finally, they inferred the 
speed of ancient currents from the varying size 
of silt particles in the mud. The more abundant 
the larger particles were, the faster the current 
was moving across the bottom. 

By reading the history of both surface and 
bottom waters in a single core, the U.K. 
researchers nailed down the order of events more 
than 8000 years ago. They found that the cold 
meltwater came in two pulses, the first about 
8490 years ago, the second 200 years later. Late in 
the first freshening, the returning conveyor cur- 
rent sweeping southward along the Gardar Drift 
began to slow. Apparently, the fresher, more buoy- 
ant surface waters slowed the sinking of the con- 
veyor’s waters into the deep sea at the current’s 


turnaround point. The second freshening repeated 
the pattern and further slowed the conveyor. 

The Gardar results are “a strong confirma- 
tion that this was a freshwater event,” says geo- 
chemist Wallace Broecker of Lamont-Doherty 
Earth Observatory in Palisades, New York. 
That’s reassuring, Broecker says, because it 
helps clear up a vexing puzzle about an earlier, 
even more drastic cooling: the 11,000-year-old 
Younger Dryas cold spell, also supposedly due 
to a glacial outburst. Broecker and geologist 
colleagues reported recently that they couldn’t 
find the route of the meltwater on land. That 
failure raised the troubling possibility that gla- 
cial meltwater had nothing to do with the 
Younger Dryas, the 8K event, or other abrupt 
coolings of the past 15,000 years. Now it looks 
as if theorists were on the right track after all. 

Next, researchers need to figure out why the 
far smaller meltwater release of the Younger 
Dryas triggered a cooling so much greater than 
the 8K’s. “The 8K was the biggest flood,” says 
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glaciologist Richard Alley of Pennsylvania 
State University in State College, “but it’s the 
one that didn’t stick,” climatically speaking. 
Had the climate system by then developed some 
protective property that warded off the abrupt 
chill? If so, will the present climate be able to do 
the same as the greenhouse sends more fresh- 
water—rain and Greenland meltwater—into 
the North Atlantic? —-RICHARD A. KERR 


Signs of Empathy Seen in Mice 


Empathy is one of the nobler human attrib- 
utes, which may explain why we’re often 
reluctant to ascribe it to other animals. A 
debate has simmered for years about whether 
chimps display empathy, for example. Now on 
page 1967, scientists argue that even lowly 
mice have a rudimentary form of it. 

The research team, led by Jeffrey Mogil at 
McGill University in Montreal, Canada, reports 
that mice become more sensitive to pain when 
they see a familiar mouse in pain. 

That probably doesn’t qualify 
as empathy as the word is 
understood in everyday 
conversation, says ethol- 
ogist Frans de Waal of 
Emory University in 


Commiserating mice. 

Observing a cagemate can 
influence pain sensitivity 
in mice. 


Atlanta, Georgia. Still, de Waal ———- 


and others say it does suggest that mice 
have some ability to sense what their fellow 
rodents are experiencing. “They’re in tune with 
each other,” de Waal says. 

In one experiment, Mogil and colleagues 
injected mice in the belly with a weak acetic 


acid solution. Solitary mice react to the injec- 
tion with a stereotyped writhing behavior, 
stretching repeatedly and extending their back 
legs. Mice spent more time writhing when the 
researchers placed them in a Plexiglas cylinder 
with a cagemate—a mouse they’d lived with 
for at least a week—given the same injection at 
the same time. When the researchers paired 
mice who'd never met, however, no significant 
increase in writhing occurred. 

The researchers also injected cagemates 
with formalin in one paw, causing them 
to lick the presumably painful area. 
In some cases, both mice received 

the same concentration of forma- 
lin, either a low dose or a high 
dose. Not surprisingly, pairs of 
mice given the high dose spent 
more time licking their paws 
than did pairs given the low dose. 
But when a mouse given the low 
dose was paired with a cagemate 
given the high dose, it licked more, on 
average, than if it had been paired with 
another low-dose mouse. More importantly, 
the high-dose mouse licked less, on average, 
than did a high-dose mouse paired with a cage- 
mate that also got a high dose. Observation, it 
seems, can reduce pain behavior as well as 
enhance it, Mogil says. > 
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Finally, the team repeated the acetic acid 
experiment and incorporated a different gauge 
of pain sensitivity, measuring the time it took for 
a mouse to withdraw its paw from a hot spot on 
the floor of the test cylinder. When observing a 
cagemate writhing from an acetic acid injection, 
mice withdraw their feet from the heat more 
quickly—even if they’d received no injection 
themselves. That’s the most important experi- 
ment, Mogil says, because it indicates that the 
mice aren’t simply imitating what they see the 
other mouse doing. “It suggests the pain system 
is being sensitized in a general manner” by see- 
ing a cagemate in pain. 

Is that empathy? It depends on whom you 
ask, says Tania Singer, a cognitive neuroscien- 
tist at the University of Zurich in Switzerland 


CHEMISTRY 


Sugary Recipe Boosts Grow-Your-Own 


Motorists aren’t alone in feeling the pain of ris- 
ing oil prices. Some commodity chemicals, such 
as the polypropylene that’s found in everything 
from textiles to dashboards, have tripled in price 
in the past few years. That surge has spurred new 
interest in the once-sleepy field of 
converting crops and other renew- 
able feedstocks into commodity 
chemicals. Chemical companies 
have made progress in recent 
years. But up to now, it has been 
difficult and costly to make the 
kinds of compounds that serve as 
starting materials for most oil- 
derived chemicals. Now, work by 
researchers in the United States 
may give plant-derived chemicals 
anew push. 

On page 1933, University of 
Wisconsin, Madison, chemical 
engineer James Dumesic and 
colleagues report a new process 
for turning fructose, the sugar 
in fruit, into a compound called 
5-hydroxymethylfurfural 
(HMFP), which can replace key petroleum- 
derived chemical building blocks. Unlike 
previous schemes for turning sugar into 
HM, the new process is efficient, easy, and 
potentially low cost. “It looks real good to 
me,” says Thomas Zawodzinski, a chemical 
engineer at Case Western Reserve University 
in Cleveland, Ohio. “This is the direction 
things need to go in.” 

Because of its current high cost, HMF 
isn’t produced commercially in large vol- 
umes. But it is easily converted into other 
compounds, such as one abbreviated FDCA 
that can serve as the starting material for 
commodity chemicals such as polyesters. On 


who has studied pain and empathy in people 
(Science, 20 February 2004, p. 1121). “Philoso- 
phers would argue you can only have empathy if 
you have consciousness,” she explains. 
“Psychologists would want to see evidence of 
altruistic behavior and altruistic motivation.” 
Mice probably don’t meet those criteria, she says. 

And not everyone agrees that Mogil’s experi- 
ments actually address the issue of empathy in the 
first place. Writhing and paw licking are reflexive 
behaviors mediated by the spinal cord, notes 
Charles Vierck, a neurobiologist at the University 
of Florida, Gainesville. “So what we have here is 
modulation of a reflex response during observa- 
tion ... of the reflex responses of other animals.” 
And that, says Vierck, is nothing new. 

Still, Singer and others, including Mogil, 


paper, converting fructose, a small, ring- 
shaped sugar, to HMF is simple. It 
requires stripping off what amounts to 
three water molecules. Researchers have 
developed numerous acid-based catalysts that 


Sweet prospects. Sugars from fruits and grains could replace petroleum as the 
basis for commodity chemicals such as plastics, if a new process pans out. 


do that quickly. Once formed, however, HMF 
readily reacts with fructose, other intermedi- 
ate reactants in the mix, and even itself to 
form a chemical zoo of unwanted byproducts, 
which sharply limits the amount of HMF 
that’s recovered at the end. In hopes of solving 
that problem, researchers have tried adding an 
organic solvent that sits atop the water like oil 
in salad dressing. HMF’s affinity for the sol- 
vent spirits it out of the aqueous phase, during 
which the unwanted side reactions take place. 
Unfortunately, separating HMF from the sol- 
vents proved difficult. Researchers had to boil 
away the solvents at very high temperatures. 
To improve the process, Dumesic and his 
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interpret the findings as evidence that mice have 
“emotional contagion,” a primitive kind of 
empathy. “Emotional contagion means one 
baby starts crying and all the babies start cry- 
ing,” explains Peggy Mason, a neurobiologist at 
the University of Chicago who studies pain. 
Unlike higher forms of empathy, it doesn’t 
require understanding what others are experi- 
encing. “The second baby doesn’t have to real- 
ize that the first baby is upset because it has a 
dirty diaper,’ notes Mason. 

Many researchers see emotional contagion 
as a steppingstone toward the more sophisti- 
cated kind of empathy that evolved in 
humans. “To imagine that empathy just 
started de novo in primates seems biologically 
implausible,” says Mason. -GREG MILLER 


Plastics 


students Yuriy Roman-Leshkov and Juben 
Chheda had to solve these and other problems 
simultaneously. They did so by adding a series 
of different compounds both during the aque- 
ous phase, in which catalysts convert fructose 
into HME, and to the solvent. In the 
aqueous phase, the additives— 
abbreviated DMSO and PVP— 
suppressed side reactions, thereby 
encouraging HMF production. 
Unfortunately, they also increased 
HMF’s solubility in water, making it 
harder for the solvent, known as 
MIBK, to remove HMF from the 
aqueous phase before it could react 
further. The Wisconsin team over- 
came that obstacle by spiking the 
MIBK with a dash of a compound 
called 2-butanol, which increased 
HMF’s affinity for the solvent. 
Finally, because MIBK has a low 
boiling point, the Wisconsin team 
could easily evaporate it along with 
the 2-butanol, recover the HMF, and 
return the solvents to the reactor. 
The changes doubled the percentage of 
fructose that gets converted into HME, to 85%. 
With that boost and related improvements, 
“now you can make some pretty compelling 
arguments” for producing HMF commer- 
cially, says Todd Werpy, an expert on produc- 
ing bio-derived chemicals at Pacific North- 
west National Laboratory in Richland, Wash- 
ington. Producing commodity chemicals from 
renewable feedstocks “‘is really in its infancy,” 
Werpy says. But with top research groups now 
training their sights on the problem, he adds, 
“renewables could make a major contribution 
to the chemical needs in the United States.” 
—ROBERT F. SERVICE 
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every mouse gene struggles 
to get its act together 


A Mouse for Every Gene 


IN ADRIANO AGUZZI’S EXPERIENCE, 
getting hold of a new mouse strain can be 
nothing but trouble. A neuropathologist at the 
University Hospital of Zurich in Switzerland, 
he is one of thousands of researchers who 
study mutant mice for clues to what particular 
genes do. “Once I requested a mouse, and the 
guy wanted everyone from himself to his 
grandmother to be a co-author on everything 
we published with that mouse,” 
says Aguzzi. “It was like sci- 
entific prostitution.” Another 
time, he says, a researcher 
promised him a mouse but 
took more than a year to 
deliver: “[The investigator] 
should have just said his cat 
ate it; it would have saved us a 
lot of trouble.” 

Most mouse researchers can 
tell similar horror stories. But 
help is on the way. Several large- 
scale projects plan to disable 
every gene in the mouse genome 
and make the resulting mice 
readily available to the research 
public. In January, Europe and 
Canada embarked on ambitious 
efforts that together will produce 
more than 30,000 knockouts. 
And this summer, the U.S. National Institutes of 
Health (NIH) will announce the Knockout 
Mouse Project (KOMP), which will add another 
10,000 to the list. China, too, is gearing up to 
make 100,000 mutants, with the goal of making 
20,000 lines of mice, each with a different gene 
knocked out. (see sidebar, p. 1864). All told, 
these efforts will cost almost $100 million. 
Although separate entities, “the plan is to have 


every center work together, much like [what] 
was done with the Human Genome Project,” 
says Allan Bradley, director of the Wellcome 
Trust Sanger Institute in Cambridge, U.K., 
which is part of the European effort. 

Indeed, overall, the knockout effort is arguably 
the largest international biological research 
endeavor since the Human Genome Project. And 
it is the next major step in figuring out what 


Holy Grail? Marina Picciotto would love to find a mouse that caves to peer pressure, 
but chances are it’s hidden away or hasn't been made yet. 


makes us tick. The human and mouse genome 
projects each identified some 25,000 genes, most 
quite similar between the two species. But 
researchers have no idea what more than half of 
these genes do. Because the mouse is so amenable 
to genetic manipulation, and so well studied, 
mass-produced mutant mice offer a window into 
these unknown genes. “The Human Genome 
Project wasn’t done just to get the sequence,” 


says Christopher Austin, director of the NIH 
Chemical Genomics Center and KOMP’s 
founding father. “It was a prerequisite for figuring 
out what our genes do.” 

How the individual mass-knockout projects 
will work together is still being ironed out. Each 
project is embarking on a different—and not 
necessarily compatible—approach to making 
its mutant mice, and the logistics of keeping 
track of all the mutants made 
are daunting. In addition, each 
effort will need to work out 
an efficient way to catalog and 
distribute the mice it creates. 
They will also have to deal with 
intellectual-property claims 
when one of the new mutants 
turns out to be a previously 
patented mouse strain. “The 
mouse project could open up 
huge areas of science, just like 
the Human Genome Project 
did,” says Marina Picciotto, a 
molecular neurobiologist at Yale 
University, “but there are likely 
to be hiccups along the way.” 

Although Picciotto and most 
of her colleagues are optimistic 
about mass-produced knock- 
outs, some wonder whether the 
efforts are the best use of public resources. 
Knocking out genes is really just the beginning. 
Those tens of thousands of mutant mice won’t 
do many researchers much good until the 
behavior, morphology, and physiology of these 
knockouts have been described. Characterizing 
each mouse will not be easy. “You can knock 
out every gene, but if you don’t have assays to 
evaluate them, it’s hard to figure out what the 
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Buyer beware. Deactivating the same gene in 
Black 6 (left) and 129 mice may yield widely 
different phenotypes. 


gene is doing,” says Marnie Halpern, a zebrafish 
geneticist at the Carnegie Institution of 
Washington in Baltimore, Maryland. 


Hiding out 

As a group, the knockout projects are trying to 
create something akin to the international super- 
store IKEA, where, in a single trip, customers 
can buy a houseful of easy-to-assemble furniture 
at reasonable prices. In this case, however, 
researchers wouldn’t even have to make a trip 
to the store. Ideally, they would simply go to a 
central database and click their own computer 
mouse to order the knockout mouse of their 
choice. Within weeks, frozen embryos would 
arrive at their door. Like IKEA, some assembly 
would be required: turning those frozen embryos 
into live mice. But that requirement is minimal 
compared to the tens of thousands of dollars and 
a year or more of work involved in creating an 
average knockout mouse. 

Such a resource would be a far cry from 
today’s mouse trade, which is more like buying 
furniture from neighbors. Selection is limited, 
quality varies, and some items just aren’t for 
sale. Part of the problem, says Francis Collins, 
director of NIH’s National Human Genome 
Research Institute in Bethesda, Maryland, is 
that until recently, researchers often didn’t know 
what the lab down the street—let alone one in 
another country—was doing. Investigators 
aren’t required to place their mice in public 
repositories, and some never write up knockouts 
they don’t find useful. 

To remedy this situation, NIH went on a 
mouse hunt. It started its inquiry at the Jackson 
Laboratory (JAX) in Bar Harbor, Maine. JAX 
stores more than 800 varieties of mutants and 
maintains a database of every published mouse 
knockout. Then NIH went door-to-door, 
publishing a request asking investigators go to 
a JAX Web site and list any knockouts they had 
created and were willing to share with the 
research public. 

The findings were dispiriting. All told, the 
mouse community had knocked out about 
11,000 genes, but many labs were repeating 
work done elsewhere. More than 700 knock- 
outs had been created three times or more; in 
one case, a single mouse had been duplicated 
11 times. And of the 4000 unique knockouts 
that have been published, more than 3000 are 
not in public repositories, meaning most are 
either unknown or unavailable to the wider 
community. “It’s embarrassing,” says Collins. 
“A graduate student shouldn’t spend a year 
making a knockout that’s already been made. 
It’s not a good use of resources.” 

Yale’s Picciotto is a case in point. Asa 
researcher who studies the genetics of addiction, 
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Out cold: Lexicon is making thousands 
Of mouse knockouts in embryonic stem 
cells. These frozen lines will become part 
of the TIGM resource. 


NIH Knocks Out Key Mouse House 


When the Texas Institute for Genomic Medicine (TIGM) applied to be part of a new $50 million 
U.S. National Institutes of Health (NIH) program to knock out as many mouse genes as possible, 
it seemed to be a shoo-in. Thanks to a partnership with Lexicon Genetics in The Woodlands, Texas, 
TIGM already has in its freezers knockouts for nearly a third of all mouse genes—twice what global 
knockout projects have achieved so far (see main text). “Taking us on would have made it easy for 
[NIH] to fulfill its mission,” says TIGM President Richard Finnell. 

Instead, he says, NIH has rejected his institute’s application, potentially forcing NIH’s Knockout 
Mouse Project (KOMP) to start from scratch and positioning TIGM as a possible competitor. NIH 
won't comment on the move until it announces the winners of the competition later this summer, 
but some in the mouse community feel that Lexicon’s reputation for tough intellectual property 
(IP) restrictions may have hurt TIGM’s chances. Finnell insists, however, that TIGM will place no 
IP restrictions on its knockouts. 

Founded as a nonprofit organization last summer with a $50 million award from the Texas 
Enterprise Fund—a $295 million pot set up by the state to create jobs—TIGM’s mission is 
essentially identical to that of the global knockout effort: Establish a massive mouse-mutant 
resource in embryonic stem cells and distribute these lines to academic scientists at cost. But 
while the global program’s players are just beginning to churn out knockouts, TIGM, which is 
based in Houston and College Station, has lept ahead. 

It has used $30 million of its $50 million to purchase Lexicon’s growing library of knockouts in 
the coveted Black 6 strain of mice; starting this month, researchers can order any of 7500 unique 
knockouts—representing about a third of the mouse genome—and they'll have access to knockouts 
covering more than two-thirds of the genome by late 2007, says Finnell. 

Becoming part of KOMP would not only have helped NIH achieve its goals more quickly and cheaply, 
says Finnell, but it would have also made TIGM’s mouse lines more economical for researchers. Without 
NIH support, TIGM will still be supplying knockouts years before KOMP, says Finnell, although 
researchers may have to pay more for them. (Pricing details are still being worked out.) 

Lexicon CEO Arthur Sands is confounded by NIH’s decision. “It just doesn’t make sense,” he 
says. “[Our] resource is already on the ground.” Neither Sands nor Finnell would speculate on why 
NIH decided not to collaborate with the institute. And outside scientists were hesitant to speak on 
the record. But some researchers Science spoke to said IP restrictions Lexicon has imposed in the 
past—such as requiring labs and universities to sign away certain rights related to discoveries 
made using its mice—have been problematic. Under the TIGM deal, however, those restrictions are 
lifted, says Finnell, “so that wouldn't have been an issue.” 

Others say NIH is interested in more cutting-edge science than Lexicon is using to make its lines. 
Ideally, for example, KOMP centers will use gene-specific targeting technology in addition to random 
gene-trapping technology. According to Finnell, Lexicon’s library is being made almost exclusively by 
means of gene trapping (see figures, p. 1865), although he says that—with NIH funding—TIGM 
would have tried to complete the remaining third of the resource using gene targeting. 

Despite the NIH setback, TIGM is planning to make its mark in the mouse world. “It will cost 
more now, but we're going to get these lines out to researchers,” says Finnell. “When people think 
about knockout mice, they'll think about TIGM.” -D.G. 


she would love to find a mouse that caves to 
peer pressure. So far, she’s managed to make a 
few handy knockouts. Some shun nicotine; others 


dig opiates. One even seems to be operating on 
a natural antidepressant. But for a complete 
picture of the mouse social psyche, Picciotto 
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China Takes Aim at Comprehensive Mouse 
Knockout Program 


SHANGHAI—Geneticist Xiaohui Wu looks through a window into a clean 
room on the campus of Fudan University here and proudly points to a 
growing collection of mutant mice. To a visitor, the 4000 cages and 
20,000 mice representing 400 mutant strains look pretty impressive. To 
Wu, the scale of the operation is a frustrating limitation. 

“We plan to mutate 70% of the mouse genome over the next 5 years,” 
he says. Yet, their current facilities are filled to capacity. A new building will 
provide space for 10,000 more cages. But Wu needs 50,000 more, enough 
for about 100,000 mutant mice. Those cages, he says, require a lot more 
space and “a lot of money.” 

Throughout the world, researchers are setting up programs to shut 
down the mouse genome gene by gene to learn what each gene does (see 
main text). The Fudan University mouse facility—a joint effort with Yale 
University—is shooting to be a key player and hopes to team up with the 
U.S. National Institutes of Health (NIH) Knockout Mouse Project. 

The driving force behind the tentatively named Mammalian Functional 
Genome Project is Tian Xu, a geneticist at Yale University School of Medicine 
who is also an adjunct professor at Fudan. The Fudan-Yale group, along with 
colleagues at the University of Colorado, Boulder, and Duke University in 
Durham, North Carolina, has come up with an efficient way to knock out 
mouse genes. They use a transposon, a short segment of DNA that invades 
genomes, sometimes inserting itself into a gene and deactivating it. 

Developmental biologists have used transposons to disable genes in 
plants, worms, and fruit flies for 
years, but they had not found 
one that worked well in mam- 
mals. After 8 years of searching, 
Xu and his colleagues found 
“piggyBac,” which was first 
identified in the cabbage looper 
moth by molecular virologist 
Malcolm Fraser of the University 
of Notre Dame in Indiana. “We 
don't know why it works,” says 
Xu. But it does. The group 
reported its finding in the 
12 August 2005 issue of Cell. 

The technique is similar to 
gene trapping in that it 
randomly disables genes. But 
using a transposon avoids the 
laborious manipulation of 
embryonic stem cells required by other knockout techniques. The researchers 
made a line of mice that carry both the transposon and DNA that causes the 
transposon to move. When they mate these mice with wildtype mice, the 
transposon hops to a new place, preferably to a gene. “All you need to do is 
just breed mice, and each has different genes mutated,” Xu says. This 


Loyal alum. Yale’s Tian Xu and his alma 
mater are making mutant mice. 


In the pink. UV light reveals 
which mice (hot pink) carry a 
transposon with a fluorescing 
protein gene. 


approach can hit genes other knockout approaches tend to miss, he adds. 

Also, the Fudan-Yale group has put the gene for red fluorescent protein 
into the transposon. Mice that wind up with the transposon in their genomes 
are pink under ultraviolet light. “You just look at it, and you can tell” if the 
genome is carrying the transposon, Xu says. 

The Fudan-Yale team opted to set up its large-scale mouse facility in China 
to save money. Xu estimates that this project could cost one-fifth to one-fourth 
what it would cost in the United States. But itis still not cheap, and international 
researchers are impressed by the $12.5 million already pledged from national 
and local government funding agencies. “I think it’s great that [the Chinese] 
are doing this,” says Phil Soriano, a developmental biologist at the Fred 
Hutchinson Cancer Research Center in Seattle, Washington. Wolfgang Wurst, 
director of the Institute of Developmental Genetics at the National Research 
Center for Environment and Health in Munich, Germany, thinks the project is a 
welcome indication of China’s increasingly international orientation. “It is a 
sign that they are serious research partners,” he says. 

To leverage support from China itself, Xu and Wu are asking for $30 million 
from NIH to start mass-producing, preserving, and distributing mutant mice. 
For the cost of shipping and handling, researchers will receive frozen embryos 
or sperm, with no intellectual-property—rights restrictions attached. Also, the 
NIH money would go a long way toward producing the 100,000 strains of 
transposon-modified mice. Wu and Xu need that number of strains to be sure 
they have 20,000 genes covered, because the transposon also lands on non- 
coding regions. If they don’t get NIH funding, they may have to recoup some 
costs by charging fees or placing restrictions on mutant mice, Wu says. 

At this point, the other programs are simply making knockout strains, 
But here, researchers are busy screening the more than 400 mutant mice 
they have generated over the past year, looking for phenotypes from neuro- 
physiological, immunological, and disease angles, among others. That 
information will go up on the Web prior to publication, making it easier for 
potential users to see which mouse will best suit their needs, the duo point 
out. Four hundred mutants is about the limit until the team’s new facility 
comes on line. After that, the view through these new clean-room windows 
will get even more interesting. —DENNIS NORMILE 


needs an animal that wants drugs just because his 
companions have them. 

Setting out to make her dream mouse is not 
really an option, however, because she has no 
clue what gene might influence peer-pressure 
sensitivity. Picciotto might be able to find the 
mouse in the community after an exhaustive 
search, but, if it exists, there’s a good chance 


it’s tucked away in a cage in a lab somewhere or 
frozen down as a clump of embryonic stem 
cells in a biotech company. Either way, it’s as 
good as gone. 

Even if Picciotto finds what she is looking for, 
that’s hardly the end of the story. “I’m sorry to say 
that there are a few labs out there [that] won’t 
share their mice even if they’ve published them in 


ajournal [such as Science or Nature] that requires 
them to do so,” says M. Celeste Simon, a develop- 
mental and cancer biologist at the University of 
Pennsylvania Cancer Center in Philadelphia. 
And as Aguzzi knows all too well, reticent mouse- 
makers can effectively quash efforts to use their 
mice by stalling delivery or making outrageous 
demands about co-authorship. 
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Gene Trapping 


Only one type of vector needed 


NEWSFOCUS 


Gene Targeting 


Different vector needed for each gene 


Genes in embryonic stem cells disrupted at random 


Adults containing modified cells crossed to produce full knockouts 


Assuming the source of the mouse is coop- 
erative, “transferring mice is an extremely 
difficult and time-consuming process,” says 
Simon. Some of Simon’s Penn colleagues lost 
2 years of work when mice they ordered from a 
government facility turned out to be infected 
with an extremely contagious virus that can 
alter phenotypes. “It strikes fear into one’s 
heart,” she says. “Two years is a lifetime in the 
world of science.” Other investigators complain 
about the cost and hassles of shipping or 
draconian material transfer agreements. 

Over the past 6 years, several efforts have 
popped up to help address some of these prob- 
lems. The International Gene Trap Consortium, 
for example, runs a database that enables 
researchers to track down about 20% of the exist- 
ing unique mouse knockouts. And repositories 
themselves—most of which are publicly funded 
and store anywhere from 500 to 4000 mice—are 
beginning to work together under the Federation 
of International Mouse Resources to help make 
sure researchers around the world can get any 
mouse in any repository. 


Modified embryonic stem cells injected into very early embryo 


' ' 


The big push 
Realizing that these were just baby steps, mouse 
researchers from several countries decided in 
2003 to take a giant leap. At a meeting at the Cold 
Spring Harbor Laboratory in New York, they 
called for a comprehensive international mouse 
knockout program. Besides shooting for an 
IKEA-like superstore, the participants agreed 
that it would be most economical to avoid 
trafficking in live mice and instead decided to 
maintain the knockouts as embryonic stem (ES) 
cells: clumps of tissue that can be frozen down 
and later grown up into full-fledged mice. 
Researchers could request ES cells or be 
provided with easier-to-use frozen embryos or 
sperm. They also proposed to use NIH’s National 
Center for Biotechnology Information as their 
clearinghouse. Its Web site would act as a sort of 
Google to scan mouse repositories for the desired 
knockout. “The ultimate goal is to have one-stop 
shopping [for these mice],” says KOMP Program 
Director Colin Fletcher. 

Two years after the meeting, Wolfgang 
Wurst, director of the Institute of Developmental 


Specific genes targeted 


Modified embryonic stem cells injected into very early embryo 


Adults containing modified cells crossed to produce full knockouts 
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Different strokes. There’s more than one way to 
knock out a mouse, but each has its pros and cons. 


Genetics at the German National Research 
Center for Environment and Health (GSF), and 
his colleagues set up the European Conditional 
Mouse Mutagenesis Program (EUCOMM). To 
get the program rolling, the European Union 
has promised $16.3 million over the next 3 years. 
The bulk of the EUCOMM effort is divided 
between two institutes: GSF and the Sanger 
Institute. GSF will use “gene trapping” (see 
diagram, above left) technology to randomly 
knock out 12,000 genes in ES cells. The Sanger 
Institute and GSF will use “gene targeting” 
technology to disable 8000 preselected genes 
(see diagram, above right). 

“Tt’s an ambitious program,” says Bradley, 
who is leading the Sanger effort, “but we’re 
fairly confident we can meet our goals.” So 
far, GSF has produced about 3700 unique 
knockouts, which researchers can order for 
$631 apiece. Bradley expects Sanger’s lines to 
start becoming available by late 2007. 
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At the same time EUCOMM was getting 
started, Canada came out with the North 
American Conditional Mouse Mutagenesis 
Project (NorCOMM). Over the next 5 years, 
Genome Canada will spend $8 million for 
knockout work primarily at the University of 
Toronto and the University of Manitoba. The 
project has produced 3000 gene-trapped 
knockouts and hopes to make 9000 more over 
the next 18 months. 

NIH’s upcoming knockout effort is similar in 
scope and direction. KOMP expects to spend 
$50 million at up to four soon-to-be-named 
centers to build a library of 10,000 knockouts 
(see sidebar, p. 1863). Like EUCOMM, KOMP 
will likely use a combination of gene trapping 
and gene targeting to produce its knockouts. 
Targeting allows researchers to make precise 
mutations in their gene of choice, says Fletcher, 
and targeting will be easier to coordinate among 
KOMP centers and with the international part- 
ners because each group will know exactly what 
gene it’s going after. 

But there are important differences between 
KOMP and the other programs. EUCOMM and 
NorCOMM are making so-called conditional 
knockouts, in which the genes that are swapped 
into the genome have a self-destruct sequence. 


Also, of all the mouse efforts, only KOMP 
will focus on “repatriation.” Thanks to NIH’s 
detective work, the agency has compiled a list of 
the “lost” mice in the community. Recently, in a 
sort of mouse version of American Idol, NIH 
posted a request asking researchers to vote for 
the top 20 mice on this list that they’d like to see 
in a public repository. “That helped us prioritize 
500 to 600 mice to repatriate,” says Fletcher. 

Part of the KOMP effort will involve con- 
tacting the owners of these mice and asking 
them to put their animals in a globally accessi- 
ble repository. NIH kicked off this program 
earlier this month, with $800,000 split between 
the University of California, Davis, and the 
University of Missouri, Columbia, to acquire 
300 of these lines. KOMP leaders hope the 
repatriation effort will conserve resources by 
obviating the need to make these lines again. 


Trouble ahead? 

But before a global 
knockout mouse 
emporium opens it 
doors, the interna- 
tional effort must 
overcome a number of 
hurdles. Topping the 


Gone, but not completely. Without the Dicer gene, a mouse embryo (inset, left) is small compared to a 
normal embryo (inset, right) and dies within a week. But when the gene is programmed to turn off just in 
skin cells, this conditional knockout mouse is born, but has very little hair (above). 


The new gene encodes information that tells it at 
which point in development or in which tissue to 
disappear. The strategy is especially important 
for determining the function of essential genes, 
which, if shut off too early, can kill a mouse 
while it’s still an embryo, short-circuiting studies 
of the gene’s effects. 

When KOMP knocks out a gene, however, 
it’s dead from day one. More embryos may die 
than with conditional knockout technology, but 
these “frank null” knockouts are still very 
informative, says Fletcher. They tell researchers 
whether a gene is necessary for development. 


list is figuring out how to avoid the knockout 
duplication already seen in the mouse commu- 
nity. That’s going to be a challenge, especially 
once each effort is cranking out hundreds of 
knockouts a month, often in random genes. 
EUCOMM’s Wurst admits it will be “hard to 
coordinate” his gene-trapping program with 
NorCOMM’s, because neither can predict 
which genes it’s going to knock out. And the 
American and European groups have yet to 
factor in the knockouts coming in from China. 
Even if redundancy can be addressed, it will 
still be caveat emptor for researchers who need 


to compare mice made by different projects. 
KOMP plans to use a strain of mouse called 
Black 6, whereas EUCOMM and NorCOMM 
are making their mutants in strain 129. That could 
cause studies of behavioral genes, for example, to 
yield skewed results. “Some 129 strains are 
really stupid, while Black 6 has a reputation for 
being smarter,” says Yale’s Picciotto. “You can’t 
compare the two.” 

Another unresolved issue is what to do 
about knockouts that are knockoffs of an 
already-patented mutant. Several biopharma- 
ceutical companies, including Deltagen in San 
Carlos, California, make their money selling 
big-ticket knockout mice. Deltagen, which last 
year earned $6.7 million from its catalog of 
900 knockouts, is seeking “broad patents” on 
the majority of its lines, says CEO Robert 
Driscoll. Driscoll would not comment on what 

steps, if any, the company would 
take if KOMP or another effort 
remade one of its patented mice. 
On the academic side, some 
researchers question the way the 
global endeavor is taking shape. 
“I’m not totally convinced [this 
effort] is going about things the 
right way,” says University Hospital 
of Zurich’s Aguzzi. He worries that 
the variety of strains and technolo- 
gies being used will lead to glitches 
in these high-throughput enter- 
prises. The global effort is “layers 
of magnitude more complicated than 
the Human Genome Project,’ he warns. 

Aguzzi also emphasizes the need to take one 
step at a time. He argues that plenty of knock- 
outs have been made with specific biological 
questions in mind and that these questions 
should be answered first. “Putting so much 
effort into creating a bunch of lines that people 
may not be able to ask the right questions with 
may not be the best use of resources,” he says. 

Each effort will try to address this concern 
by growing a subset of its frozen lines into 
live mice and then characterizing them. This 
information will then be uploaded into the 
central database, so researchers such as 
Picciotto might find their dream mouse. But a 
massive phenotyping effort is still years 
away—the next big step after this big step. 

Despite these caveats, the global project 
should have a dramatic impact on both basic and 
biomedical research, says Picciotto. “Ordering a 
mouse is never going to be as easy as ordering an 
antibody,” she says. But as the global project 
matures and begins to characterize the knockout 
lines in its libraries, even researchers in small 
labs and those who are not mouse geneticists will 
be able to delve into the world of the knockout 
mouse. “Before, scientists were limited by their 
experience and their resources,” she says. “Now 
they’ll only be limited by their imagination.” 

—DAVID GRIMM 
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Long-Ago Peoples May Have 
Been Long in the Tooth 


Life may have been nasty and 
brutish for our prehistoric fore- 
bears, but it wasn’t necessarily 
short. Contrary to the notion that it 
was rare for someone to reach the 
age of 40 in prehistoric societies, 
studies of modern hunter-gatherer 
groups suggest that a substantial 
proportion of our ancestors sur- 
vived into their 70s, says an anthro- 
pologist who has been studying 
indigenous people in Bolivia. 

Speaking last week at the meet- 
ing of the Human Behavior and 
Evolution Society in Philadelphia, 
Pennsylvania, Michael Gurven of 
the University of California, Santa 
Barbara, reported on mortality data collected 
from 10 modern-day groups of hunter-gatherers 
and forager-horticulturists, including the 
Tsimane Indians in north-central Bolivia, which 
he and Hillard Kaplan of the University of New 
Mexico have studied since 2001. 

Gurven’s analysis revealed that 40% to 
50% of the members of these groups never 
make it to age 15. But their prospects 
brighten after that, he says: A 15-year-old has 
a 40% chance of reaching 65, and by the time 
they reach 70, the mortality rate is no higher 
than for a U.S. resident. Evolutionary psy- 
chologist Daniel J. Kruger of the University 
of Michigan, Ann Arbor, says the new work 
“challenges current thought on ... the shape 
of the hominid survival curve.” Gurven and 
other scientists estimate ages and mortality 
of contemporary indigenous groups through 
a variety of convergent techniques, including 
interviews, old missionary records, historical 
events, and photographs. In contrast, recon- 
structions of prehistoric populations rely pri- 
marily on skeletal data. 

“Some reconstructions of prehistoric popu- 
lations tend to show life expectancies of 15 to 
25 years,” Gurven says, “with relatively low 
infant and child mortality but extremely high 
adult mortality.” Not only is Gurven’s work at 
odds with that scenario, but he says that “adult 
life expectancy is remarkably similar across 
these groups.” 

The research points to an existence struc- 
tured around longevity. “Adult-level produc- 
tion is controlled by skills and knowledge 
rather than physical restraints,” says Gurven. 
Although the men in these groups reach the 
height of their physical powers in their 20s, it 
is not until their 40s that they reach the peak of 


in Bolivia. 


Age old patterm a 
Tsimane senior citizen 
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hunting prowess, he notes. Rice production by 
males peaks in their 50s, as they turn from 
hunting to less rigorous agricultural pursuits. 
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And there is a pattern of resource flow from 
old to young that continues into the 70s. 

Kruger says the research indicates that “once 
individuals pass the concentrated risk in infancy 
and early childhood, they are more likely to 
survive into the older age range (65—75) than 
previously thought.” University of Michigan, 
Ann Arbor, anthropologist Rachel Caspari, who 
studies ancient populations, says that it’s not 
possible to know whether prehistoric people 
were living as long as modern hunter-gatherers 
without “high-resolution” ways of determining 
ages of death from fossils. However, she says 
Gurven’s results make sense and that her 
research supports the notion of a “modern 
human demographic pattern” that probably has 
pertained for the past 30,000 years. 

Gurven reported that the leading causes of 
death among those middle-aged and older 
appear to be from infections and violence. But 
these indigenous groups share some of the 
same scourges of aging that affect modern 
societies. By the age of 65, he says, “almost all 
have significant pain” from orthopedic prob- 
lems, especially back pain. 


An Evolutionary Squeeze on Brain Size 


Despite the huge individual differences in 
mental abilities, the size of the human brain 
varies remarkably little from person to person. 
In fact, brain volume is evolutionarily more 
stable than that of any other bodily organ, 
researchers reported last week. They speculate 
that our brain, after increasing dramatically in 
size in early human evolution, ran up against 
the skull-size limitations imposed by the 
female birth canal. 

Evolutionary psychologists Geoffrey Miller 
of the University of New Mexico, Albuquerque, 
and Lars Penke of Humboldt University in 
Berlin calculated for the human brain a measure 
called “coefficient of additive genetic variance,” 
or CVA. A formula involving the size range of a 
physical feature and its heritability, CVA 
reflects “evolvability”: that is, the extent to 
which the relevant genes are susceptible to 
change through mutation and natural selection. 

The heritability of brain is remarkably 
high—about 0.9, studies have shown. That 
makes brain size even more genetically 
influenced than height, according to Miller. 
Despite the brain’s complexity, he says, “at 
the genetic level, [brain size] seems as if 
it’s a really simple trait like fruit fly bristle 
number, ... astonishingly ironclad against 
any environmental perturbation.” 

Miller notes that he and Penke were sur- 
prised at the relatively low CVA for the human 
brain, because they assumed it had been subject 


to intense selection. CVAs are likely to be higher 
for traits that are fitness-related and therefore 
good candidates for natural selection. But the 
brain’s CVA of 7.8, low for a volumetric trait, 
means there is limited potential “evolvability.” 
Other features with substantial heritability, such 
as breasts and kneecaps, have much higher 
CVAs, said Miller. 

Certainly, there has been selection for brain 
volume in the past, said Miller: As hominids 
became more intelligent, their brain size tripled 
over a 2-million-year period to about 1400 cubic 
centimeters, compared with 370 for chimps. But 
its growth plateaued about 200,000 years ago, 
Miller speculates, when it “reached the physical 
constraint of pelvic size.” As a result, he says, 
brain size “is not a good index of IQ in recent 
evolution.” Adds Penke: “Virtually all theories of 
brain and intelligence evolution propose a recent 
history of directional [i.e., ‘more is better’ ] 
selection for both.” But “recent directional 
selection” on intelligence must have worked on 
brain features other than absolute size. 

That notion is corroborated by brain- 
imaging studies, says Richard Haier of the 
University of California, Irvine. Although 
the correlation between brain size and IQ is 
only a modest 0.4, he says, the latest imaging 
studies show much higher correlations of IQ 
with a “small number of discrete areas” of 
gray matter. 

—-CONSTANCE HOLDEN 
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CELL BIOLOGY 


Podosomes and Invadopodia Help 
Mobile Cells Step Lively 


These feetlike structures aid the necessary migrations of immune and other cells, but 


also the deadly wanderings of cancer cells 


Anyone who has taken an introductory biology 
class is familiar with the migratory prowess of 
the simple amoeba. But many of our own cells 
could give amoebae a run for their money. 
Immune cells have to sprint to infection sites to 
ward off invading pathogens. And cells that help 
maintain the skeleton nimbly patrol bones. On 
the minus side, however, cancer cells can travel 
throughout the body and seed new tumors. 

Recently, a couple of hitherto-obscure cellu- 
lar structures, known as podosomes and 
invadopodia, have come under increasing 
scrutiny as possible contributors to such peram- 
bulations. These microscopic assemblies, which 
are possibly related, both form on the bottom sur- 
face of the leading edge of migrating cells—a 
location consistent with the idea that they help 
cells move. The composition of each reinforces 
that suspicion: Both contain proteins, such as 
actin, that have previously been linked to cell 
motility (Science, 10 October 2003, p. 214), as 
well as enzymes that can break down the proteins 
of the extracellular matrix (ECM). 

Establishing the roles of podosomes and 
invadopodia hasn’t been easy, however, because 
they have so far been studied in cells main- 
tained on artificial surfaces rather than in live 
animals. “As far as I know, no one has seen 
these structures in vivo,” notes Gareth Jones of 
King’s College London. 

Still, recent research suggests that these cel- 
lular features are crucial for understanding and 
perhaps treating a variety of diseases as diverse 
as cancer and osteoporosis. Indeed, scientists 
have already implicated malfunctioning 
podosomes as a cause ofa rare hereditary disease 
that impairs the immune system. Research on 
podosomes and invadopodia “is heating up 
because they’ve really become physiologically 
important for things such as metastasis,” says 
cell biologist John Condeelis of Albert Einstein 
College of Medicine in New York City. 


Feet for cells 

Podosomes first appeared on cell biologists’ radar 
screens in the mid-1980s. Pier Carlo Marchisio, 
currently at San Raffaele Scientific Institute in 
Milan, Italy, and his colleagues discovered the 
structures on a variety of migratory cells, includ- 
ing immune cells called macrophages and osteo- 
clasts, which help maintain bone by dissolving 
away areas that need repair. In all cases, the struc- 
tures appeared where the cells made contact with 
a surface. This suggested that they might be 
involved in cell adhesion, and thus in cell motility, 


because cells have to stick to the surface over 
which they are migrating in order to move. 
Because the structures appeared to act like cellu- 
lar feet, the Marchisio team coined the name 
podosomes for them. 

Invadopodia came along just a few years 
later, identified in 1989 by Wen-Tien Chen, now 
at Stony Brook University in New York. He 
found that cells transformed by Rous sarcoma 
virus (RSV) form protrusions at their leading 
edge when moving on a surface containing 
ECM proteins such as fibronectin. More 
intriguing, Chen noticed that holes appeared in 
the protein substrate at the precise spots where 
the protrusions were located—an indication that 
they carry proteases that digest the ECM. 


Going mobile. In a moving macrophage (bottom), podosomes 
(stained red for actin and green for vinculin) congregate on 
the cell's leading edge while they ring the bottom of a quiescent 
macrophage (top, stained red for actin). 


“Invadopodia are more than just feet,” Chen 
says. “They have a functional effect.” The ability 
to digest ECM proteins would be very useful to 
migrating cells, including cancer cells that need 
to burrow through blood vessel walls in order to 
spread to distant sites. 

Marchisio and his colleagues found similar 
structures in cells transformed by RSV but con- 
sidered them to be podosomes. Both researchers 
now say that they think they were looking at the 
same thing, yet the precise relationship between 
podosomes and invadopodia remains a burning 
issue in the field. Most researchers today define 
podosomes as more dynamic than invadopodia 
and smaller—1 to 2 micrometers in diameter 
compared to 8 to 10 micrometers. But both struc- 
tures have a core of actin filaments and contain 
other proteins that regulate actin polymerization. 
Some researchers, including Roberto Buccione 
of the Consorzio Mario Negri Sud in Chieti, 
Italy, and Mark McNiven of the Mayo Clinic in 
Rochester, Minnesota, have suggested that 
podosomes form first and then mature into 
invadopodia, but the jury is still out on this idea. 


The buzz about WASp 

Research on podosomes took off in 
the late 1990s when they were linked 
to the then-newly-discovered mutant 
gene that causes Wiskott-Aldrich 
syndrome (WAS). Because the gene 
is on the X chromosome, the disease 
afflicts mainly boys, causing them to 
be severely immunodeficient. With- 
out a bone marrow transplant, the 
children usually die before age 20 of 
infections or of immune-cell cancers 
to which they are also prone. 

Early analysis of WASp, the pro- 
tein product of the gene that was 
found to be mutated in the immune 
syndrome, suggested that it helps 
control actin function. Then in the 
late 1990s, Stefan Linder of Ludwig 
Maximilians University in Munich, 
Germany, and his colleagues 
detected WASp in the actin core of 
macrophage podosomes. Linder’s 
group and another team led by Jones 
and Adrian Thrasher of University 
College London also found evidence 
that podosome formation requires 
WASp. “Normal macrophages and 
dendritic cells have these structures, 
but cells from boys with Wiskott- 
Aldrich had none at all. That was a bit 
of a shock,” Jones recalls. 

The London team found that the 
patients’ macrophages had also lost 
the ability to respond to chemokines 
and cytokines, the chemical signals 
that normally activate immune cells 
and attract them to infection sites— 
a possible explanation for the 
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impaired immunity of WAS patients. Both 
podosome formation and the ability to migrate in 
response to chemical signals were restored by 
putting the normal WAS gene into patient 
macrophages. “This was the first link [of WASp] 
to something physiological,” Linder says. 
Further evidence for a physiological link came 
from a study that Jones, Thrasher, and their col- 
leagues described in Blood in February of last 
year. Dendritic cells serve as sentinels throughout 
the body, detecting foreign invaders and initiating 
immune responses to them. The London team 
found that the dendritic cells of mice in which the 
WAS gene had been deactivated could not migrate 
to their normal locations in the lymph nodes, 
presumably because they can’t form podosomes. 


Cancer connection 

The ability to form podosomes and respond to 
chemokines could have a dark side: Cancer cells 
may exploit those same skills to spread. In 
recent work, Condeelis and his colleagues have 
been teasing out the changes in gene expression 
that characterize metastatic cancer cells. Having 
previously shown that the metastatic potential 
of cancer cells correlates with the cells’ ability 
to migrate toward epidermal growth factor 
(EGF), the researchers inserted tiny needles 
containing EGF into mammary tumors in 
rodents and then analyzed the gene expression 
patterns of the cells that moved into the needles. 
These cells had increased expression of 
numerous genes that promote motility and 
podosome/invadopodia formation, including 
WASp. Activation of the genes “determines the 
willingness and ability [of cancer cells] to move 
to a portal to escape the tumor,” Condeelis says. 

Other work by the Albert Einstein team indi- 
cates that macrophages and their podosomes are 
partners in the crime of metastasis. The 
researchers find that as mammary cancer cells 
migrate into the blood vessels, they move along 
with the immune cells. The two cell types pair up 
because each produces a growth factor that 
attracts the other: Macrophages associated with 
the blood vessels attract metastatic cancer cells 
by secreting EGE and the cancer cells emit a pro- 
tein called CSF-1 that attracts more macrophages 
and stimulates further EGF secretion by the 
immune cells. 

The CSF-1 made by cancer cells also 
induces in macrophages the expression of 
WASp and other genes involved in podosome 
formation; similarly, macrophage-made EGF 
fosters invadopodia formation by the cancer 
cells. Macrophages could further help tumor 
cells penetrate the blood vessel wall; recent 
work by the Condeelis team has shown that the 
immune cells’ podosomes, like invadopodia, 
carry enzymes that can digest the ECM. Chen 
suggests that inhibiting such enzymes, particu- 
larly one called seprase that his team has found 
in the invadopodia of melanoma cells, may 
thwart metastasis. 


In the 1 May issue of Cancer Research, 
Chen’s group reports that seprase and a similar 
enzyme, DPP4, are also located in a complex 
on invadopodialike structures in endothelial 
cells. Because tumor growth depends on the 
creation of new blood vessels by migrating 
endothelial cells, that finding hints at yet 
another anticancer strategy: Chen’s team 
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results, with some suggesting that the two 
structures are independent of each other. 
Jurdic, Destaing, and their colleagues have 
found, however, that sealing belts only form 
when osteoclasts are in contact with apatite, the 
mineral that forms the solid framework of bone. 
Apatite somehow activates both the Rho and Src 
kinase enzymes, both of which are needed for 


Bone repair. In an osteoclast, podosomes (stained red for paxillin and green for actin) form a belt on the cell 
periphery. The podosome belt may form the sealing ring that attaches an osteoclast (inset) to a bone mineral 
site that needs repair. 


blocked endothelial cell migration in lab dishes 
with monoclonal antibodies targeting the 
seprase-DPP4 complex. 


Boning up on podosomes 
The osteoclasts of bone are also migratory cells 
that may depend on podosomes for their 
mobility—and perhaps for much more. The 
cells seek out those areas of bone that need 
repair. Once there, an osteoclast settles down 
and forms a structure called a sealing ring, a 
tight connection between cell and bone. This 
protects adjacent bone from the osteoclasts. 
“For an osteoclast to be functional, it needs to 
isolate its target from the rest of bone; it secretes 
lots of concentrated acid,” says Olivier Destaing 
of Yale University School of Medicine. 
Podosomes appear to be involved in sealing- 
ring formation. Pierre Jurdic of the Ecole 
Normale Supérieure de Lyon in France and his 
colleagues, including Destaing, found that as 
osteoclasts develop, they first produce individ- 
ual podosomes that arrange themselves in 
small rings. These rings then expand to the 
periphery of the cell, where they form a stable 
podosome belt. The early view was that the 
podosome belts might coalesce to form sealing 
rings, which are thicker, but efforts to detect 
that transition had produced contradictory 


sealing-ring formation. The researchers also 
found that inhibiting Rho induces the transition 
of sealing rings into podosome belts. 

Further support for the idea that the 
structures are related comes from Benjamin 
Geiger’s group at the Weizmann Institute of 
Science in Rehovot, Israel. His team found 
rearrangements in podosome proteins—in 
particular, a marked increase in their levels— 
at locations where sealing-zone structures 
appeared to be forming from podosome rings. 
This would be consistent with the notion that 
sealing rings are thicker versions of podosome 
belts. Ifso, drugs that inhibit podosome function 
in osteoclasts could form the basis of a new 
osteoporosis therapy; the disorder typically 
results when the bone-dissolving activities of 
osteoclasts outpace the bone-forming capacities 
of their partners, the osteoblasts. 

Many questions about podosomes and 
invadopodia remain to be answered. In particular, 
researchers are searching for markers that would 
let them observe the structures in living tissue— 
and perhaps put to rest the nagging doubts 
about the relevance of lab-dish studies. That 
would certainly give them a step up on under- 
standing these cellular feet that are now attracting 
so much interest. 

—JEAN MARX 
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WATER QUALITY 


High Court Asks Army Corps 
To Measure Value of Wetlands 


A divided Supreme Court wants the government to adopt rules on which wetlands 
deserve federal protection, but scientists say they all matter 


Which wetlands are important enough to pro- 
tect? That’s the question the U.S. Supreme Court 
put to the U.S. Army Corps of Engineers last 
week when it ruled on two Michigan cases 
involving wetlands that property owners wanted 
to develop. The answer, which will require the 
corps to define more precisely its jurisdiction 
under the Clean Water Act, will have ramifica- 
tions for wetlands across the country. 

The 1972 act requires landowners to get a 
permit for certain actions that might harm wet- 
lands. The corps has claimed an expansive reach 
that covers any wetland from which water even- 
tually drains into larger lakes and rivers. But on 
19 June, in Rapanos vy. United States, the 
Supreme Court told the corps that’s not good 
enough. For wetlands that aren’t 
next to a navigable river, the key 
opinion said, the corps should 
draw up criteria to determine 
whether they provide significant 
enough benefits for clean water 
downstream to be regulated. “It 
is a critically important decision 
for our nation’s waters,” says 
ecologist Judy Meyer of the Uni- 
versity of Georgia, Athens. 

The diversity of wetlands 
makes that task a tough chal- 
lenge. Meyer and other scien- 
tists fear that more-complicated 
regulations could translate into 
less wetlands protection, as 
developers could lobby for 
some kinds of wetlands to be 
left out of the rule. In the mean- 
time, lawyers are likely to have 
a field day as the corps strug- 
gles to interpret the Supreme Court’s decision 
on a case-by-case basis. 

The lead plaintiff in the Michigan cases, 
developer John Rapanos of Midland, argued that 
the corps had no jurisdiction because the wet- 
lands on the contested property were 32 kilo- 
meters from navigable waters and connected to 
them by a mere ditch. Four justices rejected the 
corps’ argument that it can regulate wetlands 
adjacent to any tributary of navigable water, 
with Justice Antonin Scalia writing that the 
corps could only regulate wetlands with a con- 
tinuous surface connection to “relatively perma- 
nent bodies of water.” Four other justices took 
the opposite view, arguing that the corps’ exist- 
ing jurisdiction is reasonable. 

That tie set the stage for Justice Anthony 


Kennedy’s decisive vote. He agreed with Scalia 
that the two cases should go back to the lower 
courts for further consideration, but he said the 
corps should have jurisdiction over any wetlands 
that provide a significant benefit to the ““chemi- 
cal, physical, and biological integrity” of down- 
stream waters. Those whose contributions are 
“speculative or insubstantial,” he wrote, should 
be outside the corps’ purview. 

The corps has aimed for rules that are rela- 
tively simple and easy to interpret. To decide 
whether a particular locale falls under its juris- 
diction, for example, it has relied upon aerial 
photographs or topographic maps showing how 
surface water moves from that wetland to navi- 
gable waters. Likewise, the new rule will need 


No limits. Scientists say a new rule should cover even small wetlands, like this Michigan 
fen, that excel at cleansing water. 


to be “easily understandable for the corps ana- 
lysts and the permittees,” says Richard 
Ambrose, a wetland ecologist at the University 
of California, Los Angeles. “If you make it too 
complicated, it will be paralyzing.” 

But wetlands—and their functions—resist 
easy categorization. Their impact on water 
quality varies by location and other factors; 
wetlands near an agricultural field, for example, 
will likely process more nitrate and retain more 
sediment than wetlands near a pristine forest. 
And relatively undisturbed wetlands are likely 
to increase biological diversity in downstream 
waters, a factor the Clean Water Act is meant to 
protect. Water levels and flows also vary enor- 
mously, ranging from drenched cypress 
swamps to ephemeral vernal pools. Wetlands in 


the arid west pose a particular conundrum: 
When water flows only irregularly, what met- 
rics should be used? 

Scientists say rough indicators exist to deter- 
mine whether a wetland is having a positive 
impact on water quality. Joy Zedler of the Uni- 
versity of Wisconsin, Madison, suggests look- 
ing at water birds and other aquatic wildlife, the 
potential to trap sediment and reduce floods, and 
other factors that can be readily seen or inferred. 
But those indicators don’t answer what Justice 
Kennedy most wants to know: Which wetlands 
have enough of an impact on the integrity of 
waters to qualify for protection? “It is difficult to 
draw a bright line that works in the practical 
world of regulation,” says wetlands scientist 
Barbara Bedford of Cornell University. 

Even tiny wetlands, like those in head- 
waters and along small streams, can have a 
large cumulative impact, studies have 
shown. A weak current is better than larger 
streams at trapping silt that would otherwise 
degrade habitat for salmon and other fish. A 
slow flow also means that microbes have more 
time to convert excess fertilizer and prevent 
downstream algal blooms. In 
an experimental study pub- 
lished last September in the 
Journal of Environmental 
Quality, Stefanie Whitmire and 
Stephen Hamilton of Michigan 
State University, Hickory Cor- 
ners, showed that small wet- 
lands in southwestern Michi- 
gan were responsible for half 
of the nitrate removal in the 
watershed. These benefits 
diminish when the wetlands are 
degraded, scientists say. 

The corps says it’s reviewing 
the decision and declined further 
comment. Observers expect the 
agency to start work right away 
on interim guidance, although it 
could take a year or longer to 
issue a final rule. Until a rule is 
in place, the courts will proceed 
case by case. And so will the corps, which each 
year reviews about 85,000 requests for permits. 

Reed Hopper of the Pacific Legal Founda- 
tion, which represented Rapanos, has already 
claimed victory. “The court rejected the idea 
that there are no limits on the federal govern- 
ment’s regulatory authority under the Clean 
Water Act,” he said in a statement. “It is not the 
role of the federal government to micromanage 
every pond, puddle, and ditch in our country.” 
But environmentalists say that a seat-of-the- 
pants approach offers great potential for 
mischief. “It’s an invitation to development 
interests to contest the corps’ authority over 
wetlands,” says Jason Rylander of Defenders 
of Wildlife in Washington, D.C. 

—ERIK STOKSTAD 
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Response 
RAHMSTORF CRITICIZES OUR PREVIOUS CON- 
clusions about the climate reconstruction 
method of Mann et al. (1) (MBH98). In our 
previous analyses (2, 3), we found that MBH98 
underestimates past temperature variations 
when tested in climate simulations of the 
past few centuries. Rahmstorf argues that 
the simulated Northern 
Hemisphere temperature 
lies outside the uncer- 
tainty bounds of the 
pseudoreconstructions in 
the simulation with the 
model ECHO.-G, but inside 
CLS the uncertainty bounds in 
as the HadCM3 simulation. 
\ pce 2001 He concludes that our 
analysis supports the Mann 
et al. (1) reconstructions. 
This conclusion is wrong. 
The problem is the deter- 
mination of the error 
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edited by Etta Kavanagh 


Testing Climate Reconstructions 


HadCM3 climate model 


A 2005 U.S. CONGRESSIONAL ENQUIRY (1) FOCUSED ON THE VALIDITY 
of the climate reconstruction of the past millennium by Mann et al. (2) 
and referred to a Science Report that challenged the reconstruction 
method (“Reconstructing past climate from noisy data,” H. von Storch 
et al., 22 Oct. 2004, p. 679; published online 30 Sept. 2004). This 
Report was also discussed in the U.S. Senate in 2005 (3). In this discus- 
sion, it has been overlooked that von Storch eg al.’s Supporting Online 
Material (SOM) in fact supports the Mann et al. reconstruction. 

von Storch et al. presented tests of the climate proxy method with 
two climate models: the HadCM3 model (shown only in the SOM) 
and the ECHO-G model. Both are compared in the figure. The 
HadCM3 simulation (solid blue) is consistent with the climate proxy 
data reconstruction (grey band). The ECHO-G model has since been 
found to be afflicted by a major artificial climate drift due to an 


| | 
soe 
BN ON 


o 
N 


Oo 


Temperature deviation °C 


| | | | 
SF ES 1S. 5 
oan N 


-1 
1750 


1800 1850 1900 1950 2000 


Calendar year 


undocumented, inappropriate initialization procedure (4). 

The error of simulated proxies (dotted blue) found in the 
HadCM3 model is smaller than the error margin given by Mann et 
al. for their method and shown in the IPCC report (5). For the time 
period common to both models, the RMS error of the simulated 
proxies is 0.24°C in ECHO-G, but only 0.07°C in HadCM3—less 
than one-third. 

The two models thus give rather different, conflicting results 
about the potential errors of proxy reconstructions. This is not 
mentioned in the Report, which merely states, “Similar results are 
obtained with a simulation with the third Hadley Centre coupled 
model (HadCM3), demonstrating that the results obtained here are 
not dependent on the particular climate characteristics of the ECHO- 


Test of proxy climate reconstruction method 
with two climate models, HadCM3 and ECHO-G. 
Solid lines show Northern Hemisphere temperature 
in the models (31-year running means); the dotted 
lines show simulated proxy reconstructions where 
the proxies are degraded with 75% noise. The error 
of the proxy method is the difference between the 
solid and dotted lines (arrows). For comparison, we 
show the Mann et al. 40-year-smoothed reconstruc- 
tion for the Northern Hemisphere temperature 
(black) with its 95% confidence interval (grey), as 
shown in the IPCC Third Assessment Report (5). 


bounds. 

To successfully com- 
pute uncertainty bounds 
requires an error model. 
Updated uncertainty bounds 
for the MBH98 series, on 
40-year time scales, can 
be found in fig. 1B of 
Gerber et al. (4). Mann 
was a co-author on this 
study, and these uncer- 
tainties are consistent with 
the ones derived in our 


G simulation” (p. 680). 

In addition, it has since been found (6) that the proxy method was implemented incorrectly by 
von Storch et al.; with correct implementation, the error is even smaller in HadCM3 than the 
0.07°C shown here. A similar, more recent test with the NCAR climate system model (7) also 
suggests only small errors for the proxy method, supporting the climate reconstruction of the past 
millennium by Mann et al. 


analysis (3). Further, they are about a factor of 
3 smaller than those published two years earlier 
in the IPCC Third Assessment Report (5) and 
used in Rahmstorf’s Letter (20 of roughly 0.07 
K rather than 0.25 K for circa 1800). The result 
of the pseudoreconstruction and the target tem- 
perature in the HadCM3 model are therefore 
statistically well separated when using the 
proper uncertainties (3). 

We think that the Letter [as does (5)] illus- 
trates a common confusion in our field. There 
are two sources of uncertainty in reconstruct- 
ing past climate from proxy records: (i) cali- 
bration uncertainty—which part of the signal 
is not captured by the statistical method; and 
(ii) residual uncertainty—how much addi- 
tional, unrelated variability is engraved in the 
proxy records. Our most recent comment (3) 
did not make this point explicitly, but its uncer- 


STEFAN RAHMSTORF 
Potsdam Institute for Climate Impact Research, Box 601203, 14412 Potsdam, Germany. 
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tainty estimates are based on calibration error. 
We showed that the MBH98 method imple- 
mented in the simulations leads to pseudore- 
constructed temperatures being too warm and 
with differences from the target temperature 
larger than our calibration uncertainty ranges. 

Rahmstorf also alludes to a climate drift in 
the ECHO-G simulation (2). However, this drift 
mostly affects the earlier centuries of the millen- 
nium, when the pseudoreconstruction performs 
better, and is probably minor after 1400 A.D., 
when the pseudoreconstruction performs worse. 
For instance, ECHO-G simulates a difference in 
the Northern Hemisphere temperature between 
1900 and 1980 (the calibration period) and the 
Late Maunder Minimum (around 1700) of 0.97 
K, whereas a simulation with the CSM climate 
model from NCAR yields 0.87 K (6). Therefore, 
this issue cannot explain the bias of the recon- 
struction method. 

In conclusion, we feel that the paleorecon- 
struction community would be well served if it 
used error models describing uncertainties 
from both calibration and “noise,” which leads 
to uncertainties that have complex, possibly 
intermittent nonstationary behavior on differ- 
ent time scales. We also urge the community to 
test methods using realistic “pseudo-proxies” 
as they offer a good laboratory. 

HAUS VON STORCH,* EDUARDO ZORITA,* 

JULIE JONES,* FIDEL GONZALEZ-ROUCO,2 

SIMON TETT? 

‘nstitute for Coastal Research, GKSS Research Centre, 
Geesthacht 21502, Germany. Department of Astrophysics 
and Atmospheric Physics, Universidad Complutense de 


Madrid, Madrid 28040, Spain. 3UK Meteorological Office, 
Hadley Centre, Reading RG6 6BB, UK. 
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Team Science and 
the NIBIB 


| WOULD LIKE TO EXPLORE ISSUES RELATING TO 
the funding of biomedical engineering and 
imaging at the National Institute of Biomedical 
Imaging and Bioengineering (NIBIB) that are 
raised in an article on the Whitaker Foundation 
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(“Spending itself out of existence, Whitaker 
brings a field to life,’ D. Grimm, News Focus, 
3 Feb., p. 600). 

Fiscal year 2005 was the NIBIB’s third full 
year with an operating budget and its third year 
with a double-digit percentage annual growth 
in grant applications received. Even in the face 
of this large growth, the NIBIB’s budget pro- 
jections and management plan resulted in pay- 
ing to the 20th percentile, well within the range 
of paylines for the more established institutes. 
In addition, nearly all of the funded applica- 
tions contained bioengineering, even those 
internally labeled as “imaging.” Because this 
type of science is fundamentally interdiscipli- 
nary, it is difficult to accurately describe the 
relative support of biomedical imaging and 
bioengineering, reflecting progress toward 
achieving the goal of team science. 

In addition to the pervasive bioengineering 
content in our research portfolio, the institute has 
multiple training programs that target bioengi- 
neering, interdisciplinary science, and young 
investigators. Indeed, the majority of the NIBIB’s 
current training budget supports such programs. 
Of note, the NIBIB seeks to significantly en- 
hance the success of new investigators through its 
policy that increases the payline by 5 percentile 
points for first-time investigators. During the last 
fiscal year, this policy resulted in 33% more 
funded first-time NIBIB investigators. 

It is the team science approach, inclusive of 
young investigators, that is critical for realizing 
our vision of profoundly improving health care 
through technological innovation. 

RODERIC |. PETTIGREW 
Director, National Institute of Biomedical Imaging and 
Bioengineering, National Institutes of Health/U.S. 


Department of Health and Human Services, Bethesda, MD 
20892, USA. 


Funding for Young 
Investigators at Whitaker 


WE WISH TO ADD SOME ADDITIONAL INFORMA- 
tion to the points made in the News Focus arti- 
cle “Spending itself out of existence, Whitaker 
brings a field to life” (D. Grimm, 3 Feb., p. 
600). First, it should be noted that nonimag- 
ing-related biomedical engineering research 
was relatively underfunded, not that most of 
the funding was supporting clinical imaging 
research. For example, the 2004 numbers indi- 
cated that nonimaging research projects con- 
stituted less than 40% of the funded individual 
investigator-initiated grants. Second, the total 
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research support for biomedical imaging and 
bioengineering provided by the NIBIB is in- 
sufficient to meet the large demands spawned 
by the Whitaker Foundation in this exploding 
field. Not only is the NIBIB the second small- 
est institute, but because of its establishment 
in 2002, it did not benefit from the doubling of 
the NIH budget that all other institutes previ- 
ously enjoyed. Finally, the comment about 
“not stepping up to the plate” by Nerem 
referred only to the issue of funding new 
investigators, which was the specific focus of 
the young investigator award program of the 
Whitaker Foundation. For the United States to 
remain the leader in this highly competitive 
field, it is important that the NIBIB step in 
with its own young investigator program to 
support the bright young people entering the 
world of biomedical imaging and bioengi- 
neering. It is equally important that Congress 
contribute by providing increased funding 
for the NIBIB so that the citizens of the 
United States can reap the benefits of this 
new area of science and technology. 

ROBERT NEREM? AND FRANK YIN2 
‘nstitute for Bioengineering and Bioscience, Georgia 
Institute of Technology, Atlanta, GA 30332-0363, USA. 


2Department of Biomedical Engineering, Washington 
University, St. Louis, MO 63130, USA. 


Caspase-10 in Mouse 
or Not? 


THE FAMILY OF ASPARTATE-SPECIFIC AND CYS- 
teine-dependent proteases, called caspases, is 
crucial not only for apoptosis but also for dif- 
ferentiation and cell cycle progression. Several 
research groups have recently published data 
ruling in or out the participation of caspase-10, 
an initiator caspase functioning at the apex of 
death receptor signaling, in diverse apoptotic 
processes and in neuroblastoma metastasis 
(/—5). Curiously, these conclusions were 
reached from experiments performed in mouse 
cells, a species that, according to all public 
databases and a recent study by Reed and col- 
leagues (6), does not contain the CASP-10 
gene in its genome. 

How is it possible that these reports still 
assume the presence of caspase-10 in the 
mouse? An explanation might be the use of 
inadequate tools to study processing and acti- 
vation of caspase-10, such as antibodies and 
so-called caspase-specific inhibitory peptides. 
Indeed, all of the above studies used either 
unspecified caspase-10 antibodies or antibod- 
ies that are certified in their respective compa- 
nies’ data sheets to react with cellular extracts 
of human, mouse, and rat origin. The CASP-10 
gene is also absent from the rat genome. 
Indeed, a search for commercially available 
caspase-10 antibodies found that 19 out of 44 
caspase-10 antibodies that are distributed by 
24 companies are specified on their data sheets 


to react with mouse or rat tissues. Another 
seven antibodies were only tested with human 
cells, and the specificity of at least six other 
caspase-10 antibodies that supposedly react 
only with extracts from human cells could be 
questioned, as they were generated with the 
same immunogenic peptide as were antibodies 
that cross-react with mouse and rat tissues. 

In addition, many groups based their con- 
clusions about the presence of caspase-10 in 
the mouse on the utilization of the alleged cas- 
pase-10—specific AEVD peptide. However, 
most caspases including caspase-3 and -8 dis- 
play an even higher affinity for this substrate 
(K, of 42 and 1.6 nM, respectively) than cas- 
pase-10 itself (K,, 320 nM) (7). 

Although the responsibility for this mis- 
conception lies clearly in the hands of the indi- 
vidual researchers (and maybe also with the 
reviewers of the manuscripts), the various 
companies claiming specificity and applicabil- 
ity of their antibodies in mouse systems are 
also responsible. 

REINER U. JANICKE,* DENNIS SOHN, 
GUDRUN TOTZKE, KLAUS SCHULZE-OSTHOFF 


Institute of Molecular Medicine, University of Diisseldorf, 
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Role of iNOS in Human 
Host Defense 


IN 2001, S. THOMA-USZYNSKI ETAL. WROTE IN 
Science, “in humans the TLR-activated antimi- 
crobial pathway is NO [nitric oxide]—inde- 
pendent” (/). In the Report “Toll-like receptor 
triggering of a vitamin D—mediated human 
antimicrobial response” (P. T. Liu ef al., 24 
Mar., p. 1770; published online on 23 Feb.), 
authors from the same laboratories expanded 
this view to assert that “antimicrobial activity 
against intracellular bacteria ... in murine, but 
not human, monocytes and macrophages is 
mediated principally by nitric oxide” and that 
this establishes “the evolution of divergent 
antimicrobial pathways in mice ... versus 
humans. ...” The conclusion that humans lack 
this nitric oxide defense pathway in mononu- 
clear phagocytes is based on in vitro findings 
that differ in a critical respect from observa- 
tions of human macrophages in vivo and ex 
vivo. Thoma-Uszynski et al. (/) and Liu et al. 
cultured human monocytes under conditions 
that result in little or no expression of inducible 
NO synthase (iNOS). In contrast, in vivo and 
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ex vivo, human macrophages do express INOS 
in people with infectious and inflammatory 
diseases (2, 3), notably in tuberculosis. 

The role of iNOS in human host defense 
remains unresolved. The experiments (4—6) 
that established the role of iNOS in host 
defense in mice cannot be performed in 
people. However, when macrophages express- 
ing iNOS were recovered from patients and 
infected with mycobacteria in vitro, iNOS 
inhibitors abolished the macrophages’ antimy- 
cobacterial activity (7). 

When a cell type consistently expresses an 
enzyme in vivo, but differentiates in vitro so 
that the enzyme is lacking, the cell culture sys- 
tem must be considered deficient as a model. 
Results in a nonphysiologic cell culture system 
are not a sound basis for declaring that human 
evolution has branched off to abandon the use 
of NO in host defense. 

The shortcomings of in vitro systems for 
human macrophage differentiation are frus- 
trating, particularly because it is so difficult to 
access human macrophages that have under- 
gone full differentiation and immunologic acti- 
vation in vivo. Nonetheless, a cell culture 
model is useful only to the extent that it reflects 
the biology of the organism. At present, scien- 
tists lack the ability to induce iNOS consis- 
tently in human macrophages derived in vitro 
from the monocytes of healthy donors. In vitro 
studies that truly assess the role of iNOS in 
human host defense await the development of 
techniques for iNOS induction in cultured 
human macrophages that can match that in 
macrophages of people with disease. 

CARL NATHAN 
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Response 

A KEY POINT IN OUR RECENT PAPER WAS ONLY 
partially quoted by Nathan. The complete 
quote follows: “In innate immune responses, 
activation of Toll-like receptors (TLRs) trig- 
gers direct antimicrobial activity against intra- 
cellular bacteria, which in murine, but not 
human, monocytes and macrophages is medi- 
ated principally by nitric oxide.” Our data do 
establish mechanisms by which activation of 
the innate immune system via TLRs leads to 
antimicrobial activity, and in this context, it is 
not unreasonable to suggest “the evolution of 
divergent antimicrobial pathways in mice 
(nocturnal animals that use nitric oxide) versus 
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humans (daytime creatures that synthesize 
vitamin D, in the skin on exposure to UV 
light).” We were careful to state in the paper 
that, “We do not imply that this is the only 
antimicrobial mechanism available to human 
macrophages.” Nathan’s research on nitric 
oxide in acquired immunity is clearly impor- 
tant and is cited in our paper. We look forward 
to learning the mechanism by which nitric 
oxide is activated in human macrophages and 
contributes to immunity to tuberculosis. 
PHILIP LIU,* JOHN S. ADAMS,” 
ROBERT L. MODLIN? 
1Department of Microbiology, Immunology, and Molecular 
Genetics, and Division of Dermatology, Department of 
Medicine, David Geffen School of Medicine, University of 
California at Los Angeles, Los Angeles, CA 90095, USA. 


2Department of Medicine, Division of Endocrinology, 
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TECHNICAL COMMENT ABSTRACTS 


Comment ON “Ancient DNA from 
the First European Farmers in 7500- 
Year-Old Neolithic Sites” 


Albert J. Ammerman, Ron Pinhasi, 
Eszter Banffy 


On the basis of analysis of ancient DNA from early European 
farmers, Haak et al. (Reports, 11 November 2005, p. 1016) 
argued for the Paleolithic ancestry of modern Europeans. 
We stress that the study is more limited in scope than the 
authors claim, in part because not all of the skeletal samples 
date to the time of the Neolithic transition in a given area of 
Europe. 

Full text at www.sciencemag.org/cgi/content/full/312/5782/ 
1875a 


RESPONSE TO COMMENT ON “Ancient 
DNA from the First European Farmers 
in 7500-Year-Old Neolithic Sites” 


Joachim Burger, Detlef Gronenborn, 
Peter Forster, Shuichi Matsumura, 
Barbara Bramanti, Wolfgang Haak 


The discovery of mitochondrial type N1a in Central Euro- 
pean Neolithic skeletons at a high frequency enabled us to 
answer the question of whether the modern population is 
maternally descended from the early farmers, instead of 
addressing the traditional question of the origin of early 
European farmers. 

Full text at www.sciencemag.org/cgi/content/full312/5782/ 
1875b 


Letters to the Editor 


Letters (~300 words) discuss material published 
in Science in the previous 6 months or issues of 

general interest. They can be submitted through 
the Web (www.submit2science.org) or by regular 


mail (1200 New York Ave., NW, Washington, DC 
20005, USA). Letters are not acknowledged upon 
receipt, nor are authors generally consulted before 
publication. Whether published in full or in part, 
letters are subject to editing for clarity and space. 
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POLITICAL PSYCHOLOGY 


The Perils of Prognostication 


John T. Jost 


e are all dilettantes in many, if not 
W most, areas of life and learning. 
When we ponder possible futures 


and appropriate courses of action and we 
encounter the limits of our own understanding, 
what can we do but turn to the experts on mat- 
ters ranging from the weather and the stock 
market to the health of our bodies and our 
nations and so much in between? We realize (at 
least sometimes) that we don’t know what the 
future holds, but at least the experts have a 
pretty good idea. Don’t they? 

For anyone who gains solace or inspiration 
from the conviction displayed by Sunday 
morning political pundits or “I told you so” 
Monday morning quarterbacks that populate 
every field, Philip Tetlock’s Expert Political 
Judgment will be sobering. The results of his 
painstaking research are complex, nuanced, 
and contingent, but the bottom line is clear 
enough. Tetlock’s data “plunk human forecast- 
ers into an unflattering spot along the perform- 
ance continuum, distressingly 
closer to the chimp than to the for- 
mal statistical models.” In fact, 
“it is impossible to find any domain 
in which humans clearly outper- 
formed crude extrapolation algo- 
rithms, less still sophisticated 
statistical ones” (emphasis in orig- 
inal). Worst of all, those experts 
with the poorest track records are 
the most likely to show up on TV 
screens and blogsites everywhere. 

Tetlock is a social psychologist 
by training, a political scientist by 
choice, and now a business school 
professor (at the University of 
California, Berkeley) by avoca- 
tion. For over 20 years, he has been 
a pioneer in the relatively young 
interdisciplinary field of political psychology. 
His wide-ranging, partially overlapping inter- 
ests in lay theories of epistemology and philos- 
ophy of science, cognitive styles, motivated 
reasoning, political ideology, domestic and 
foreign policy decision-making, counterfac- 
tual thinking, and accountability are all 
brought together in this, his most ambitious, 
profound, and integrative book to date. In 
many ways, it is a tour de force, providing as it 
does a vivid, sophisticated illustration of our 
limitations in forecasting and, at the same time, 


The reviewer is at the Department of Psychology, New York 
University, 6 Washington Place, 5th Floor, New York, NY 
10003, USA. E-mail: john.jost@nyu.edu 


the analytical power of our psychological tools 
when applied in retrospect. 

Tetlock asked 284 experts with advanced 
educational and professional 
training in international rela- 
tions, political science, law, 
economics, business, public 
policy, and journalism to make 
thousands of predictions be- 
tween 1988 and 2003. Par- 
ticipants rendered both short- 
term and long-term subjective 
probability estimates of hypo- 
thetical events that were inside 
and outside their domain of 
expertise, including the Persian 
Gulf War, the transition from 
Communism in Eastern bloc 
countries, the fall of apartheid in South Africa, 
the outcomes of U.S. presidential elections, the 
existence of weapons of mass destruction, and 
the bursting of the Internet bubble. These top- 


Judgment 


ics are so intriguing that one wants to see 
detailed information concerning their predic- 
tions on a case-by-case basis. Unfortunately, 
Tetlock keeps the reader fairly removed from 
the raw, unprocessed data and offers instead 
more abstract generalizations concerning the 
characteristics of better and worse judges. 

To cope with the mind-boggling complex- 
ity involved in processing over 80,000 expert 
predictions and distilling the concomitants of 
accuracy, Tetlock boils things down to a single 
dimension of cognitive style that captures most 
of the good judgment he could find. Drawing 
on an essay by Isaiah Berlin, Tetlock distin- 
guishes between “foxes,” who “‘know many 
little things,’ draw from an eclectic array of 
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by Philip E. Tetlock 


Princeton University Press, 
Princeton, NJ, 2006. 337 pp. 
$35, £22.95. ISBN 0-691- 
12302-2. Paper, $19.95, 
£12.95. ISBN 0-691-12871-5. 


traditions, and accept ambiguity and contra- 
diction as inevitable features of life” and 
“hedgehogs,” who “‘know one big thing,’ toil 
devotedly within one tradition, and reach for 
formulaic solutions to ill-defined problems.” 

How does Tetlock measure the location of 
each of his experts on the fox-hedgehog contin- 
uum? In the book’s Methodological Appendix, we 
learn that he used a factor analysis of responses 
to a “styles of reasoning” questionnaire com- 
prising 13 items. Eight items 
were drawn from the “need for 
cognitive closure” scale (7). One 
item provides respondents with 
Berlin’s definition and asks 
them to classify themselves as 
either foxes or hedgehogs. The 
remaining four items focus on 
relative preferences for simplic- 
ity, parsimony, predictability, 
and decisiveness (all of which 
are more appealing to hedge- 
hogs than to foxes). Tetlock’s 
hedgehog-fox score is based on 
the seven items that had the 
highest loadings (above 0.25) on the first factor. 

Much of the book details the ways in which 
foxes outperform hedgehogs as prognostica- 
tors and Bayesian updaters. Foxes scored 
higher than others on measures of 
calibration; their subjective proba- 
bility estimates were better corre- 
lated with the objective frequencies 
of the events they were predicting, 
especially in the short term. The 
worst judges were hedgehog 
extremists who made long-term 
predictions in their own areas of 
expertise. They correctly antici- 
pated war in the former Yugoslavia, 
but they also predicted several wars 
that did not happen. Even more 
than others, they frequently overes- 
timated the likelihood of drastic 
changes from the status quo. 

When unexpected outcomes 
occurred, hedgehogs were less 
likely than foxes to revise their 
beliefs in light of new realities. They were also 
more likely to display hindsight bias, believing 
that they “knew it all along,” even when they 
did not, and they were less charitable toward 
their competition, exaggerating the extent to 
which rivals were mistaken. The only advan- 
tage hedgehogs enjoyed—other than greater 
media exposure—was a tendency to swing for 
the home-run fences. They were almost twice 
as likely as foxes to declare certain events as 
either inevitable or impossible, and when they 
did so they were usually correct. 

There is an emerging (or perhaps reemerg- 
ing) scientific controversy concerning the 
nature of the relationship between political 
ideology and general psychological character- 
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istics. Much evidence—including results from 
Tetlock’s own research (2, 3)—indicates that 
personal needs for order, structure, and closure 
are positively associated with conservatism, 
whereas openness, tolerance for ambiguity, 
and integrative complexity are positively asso- 
ciated with liberalism (4). Others deny that 
these associations are important or consequen- 
tial (5). Given the strong overlap in how the 
fox-hedgehog dimension and these other vari- 
ables are measured, there is a missed opportu- 
nity to investigate in detail the effects of ideology 
(as mediated by cognitive style) on judgment 
and prediction. Tetlock suggests that his 
fox-hedgehog dimension is unrelated to the 
left-right ideological dimension, but he does 
not provide direct or sufficient information 
bearing on the nature of their association. 

Nevertheless, as Tetlock points out, the ide- 
ological range of his expert sample may not 
have been wide enough to adequately test the 
“rigidity-of-the-right” hypothesis. And in any 
case, it does seem likely that leftists would be 
better predictors in some domains and rightists 
in others. Given that people see the future (at 
least in part) as they would like it to be, an 
answer to the question of whether liberals or 
conservatives are more accurate in their pre- 
dictions depends upon, among other things, 
whether the world happens to turn to the left or 
right during the specified time period. 

Tetlock does illustrate, quite cleverly, the 
effects of ideology on perceptions of historical 
counterfactuals. Whereas conservatives were 
convinced that the Soviet Union would not 
have changed without external pressure from 
the West (and liberals were more optimistic 
about internally generated reform), liberals 
believed that apartheid would not have ended 
without Western sanctions (and conservatives 
found it more likely that change could have 
come purely from within South Africa). 
Taking a staple from classic social psychol- 
ogy, Tetlock concludes that, “The operative 
principle is dissonance reduction: the more we 
hate a regime, the more repugnant it becomes 
to attribute anything good to redemptive dis- 
positions of the regime (such as a capacity for 
self-correction).” 

The strength of the analysis presented in 
Expert Political Judgment lies in the author’s 
carving out the copious space between norma- 
tive standards of prediction captured by com- 
plex equations requiring difficult-to-calculate 
base rates and statistical modeling of stochas- 
tic processes, on one hand, and descriptive evi- 
dence concerning the actual predictions made 
not merely by ordinary human beings but by 
the most talented experts among us, on the 
other. To fill in that space, Tetlock adroitly 
wields a succession of theories and findings 
from social and cognitive psychology. In the 
process he advances considerably the impor- 
tant work begun by Daniel Kahneman and 


Amos Tversky (6), demonstrating just how 
desperately we need a scientific psychology of 
judgment and decision-making to correct for 
the many failings of a ruminating species. 
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SOCIOLOGY 
Fishing Rights and 
Race Relations 


Nicholas J. G. Winter 


ters of race in America has been fueled 

by an apparent contradiction. On the one 
hand, there has been a dramatic long-term 
trend among whites away from support for 
overt racism. On the other hand, support for 
policies intended to address racial inequities— 
such as school busing or affirmative action— 
has not increased. There is a disjunction 
between genuine support for the abstract prin- 
ciple of racial equality and ambiva- 
lence or opposition to policies that 
seem logically connected to that 
principle. 

The literature provides three 
basic explanations for this pattern. 
The first suggests that many 
whites oppose these policies not 
for racial motives but because they 
see such interventions as under- 
mining cherished American values 
such as individualism (J, 2). If so, 
the contradiction between support 
for racial equality and opposition 
to policies is only apparent, 
because that opposition is not ultimately racial. 
The other two approaches draw attention to the 
role of race, but in rather different ways. The 
realistic group conflict school suggests that 
white opposition to policies is based impor- 
tantly in material racial group interest: when 
policies hurt whites as a group, whites oppose 
them (3, 4). The symbolic racism school argues 
that although whites no longer generally 
endorse traditional anti-black stereotypes, a 
new form of racism has evolved that condemns 
blacks and other subordinate racial groups for 
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Mia Tuan 


The reviewer is at the Department of Politics, University 
of Virginia, Charlottesville, VA 22904, USA. E-mail: 
nwinter@virginia.edu 


Prejudice in Politics 
Group Position, Public 
Opinion, and the 
Wisconsin Treaty Rights 
Dispute 


by Lawrence D. Bobo and 
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their perceived failure to live up to American 
values like individualism and the work ethic 
(5, 6). This new symbolic racism represents a 
blending of anti-black feelings with these val- 
ues. There is long-running debate among pro- 
ponents of these approaches—and important 
variants of all three—that has turned on ques- 
tions of theory, measurement, and statistical 
methodology. 

Prejudice in Politics moves beyond the tra- 
ditional focus on black-white relations to 
explore these questions in the context of the 
controversy over Chippewa fishing and hunt- 
ing rights in northern Wisconsin. Litigation 
over the treaties that ensure these rights ran 
from the mid-1970s through the early 1990s 
and spawned an intense and extended period of 
racial conflict. Lawrence Bobo and Mia Tuan 
(sociologists at Stanford University and the 
University of Oregon, respectively) offer an 
important contribution to the racial attitudes 
literature. The book is also an excellent 
resource for a broader audience interested in 
the continuing role of race and racism in 
American society and politics. 

Drawing on sociologist Herbert Blumer’s 
classic work on race relations (7), Bobo and 
Tuan develop a “group position” model of 
racial attitudes that integrates aspects of both 
the group conflict and the symbolic racism 
approaches. They take seriously the role of 
individual-level prejudice, while also arguing 
convincingly that the social, historical, and 
political processes that create 
and institutionalize racial 
group differences are im- 
portant in determining racial 
attitudes. They argue that 
racial groups are more than 
“mere” groups: they are an 
important way that society 
allocates rights to “scarce 
and socially valued goods 
and resources.” The con- 
struction and maintenance 
of racial categories are im- 
portant ways that appro- 
priate roles, rewards, and 
outcomes are delineated in society. When that 
system is threatened symbolically—as in the 
treaty rights dispute—reactions by the domi- 
nant group can be strong and well out of pro- 
portion to the actual material threat, which 
was, in fact, minimal in this case. Bobo and 
Tuan’s compelling theoretical development is 
pleasingly accessible to those not versed in the 
literature on racial attitudes. At the same time 
the authors provide signals (and references) 
that allow specialists to place their approach in 
the context of the broader literature. 

The empirical heart of the book is based on 
a 1990 public opinion survey of Wisconsin res- 
idents, which allows Bobo and Tuan to docu- 
ment white opinion (8) on this serious racial 
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Prejudiced response. Efforts by the Chippewa to exercise their treaty rights to spearfishing in northern 


Wisconsin met with racially charged protests from whites. 


dispute. The data also allow them to develop 
measures of the key constructs from each of 
the three contending racial attitudes models. 
Their results compellingly demonstrate the 
failure of the nonracial values approach to 
explain white opinion. The authors present a 
series of statistical analyses that demonstrate 
the impact of racial predispositions on opinion, 
above and beyond individual demographic 
characteristics. They also make excellent and 
extensive use of respondents’ own words, from 
open-ended responses, to show the ways that 
white Wisconsin residents’ reactions to the 
Chippewa and to treaty rights are deeply and 
subtly inflected with racial considerations. 
These findings underline the conclusion that 
matters of race are still very much a part of 
white Americans’ political cognition. 

The data are less able to distinguish 
between the group position and the symbolic 
racism models. Both models deal with racial 
prejudice, with important—though subtle— 
differences in their understanding of preju- 
dice. Bobo and Tuan present compelling evi- 
dence that the empirical data are consistent 
with their group position model. As they 
acknowledge, however, the data are not 
incompatible with the symbolic politics 
model. I believe this is not a failure in their 
choice of this case study or in the develop- 
ment of their survey questions. Rather, the 
theoretical distinctions being drawn in the 
modern versions of these various explana- 
tions are fine enough that survey data are 
hard-pressed to distinguish among them. The 
measures of the building blocks of the mod- 
els—stereotyping, group competition, politi- 
cal or group threat, symbolic racism, group 
affect—are too highly correlated to allow a 
convincing winner to emerge from head-to- 
head statistical competition. This means that 
the authors’ ability to adjudicate between 
group position and symbolic racism models 


turns importantly on a subtle reading of the 
open-ended data. 

Despite this, Prejudice in Politics (along 
with work in the symbolic racism tradi- 
tion) has important lessons for our 
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Stoking the Voters’ 
Passions 


James N. Druckman 


stitution, The Federalist Papers make 

clear that input from citizens must be lim- 
ited because they think too emotionally. 
Federalist 49 states, “The danger of disturb- 
ing the public tranquility by interesting too 
strongly the public passions, is a still more 
serious objection against a frequent reference 
of constitutional questions to the decision of 
the whole society.” Further on, toward the end 
of the essay, the author (Alexander Hamilton 
or James Madison) concludes, “The passions, 
therefore, not the reason, of the public would 
sit in judgment. But it is the reason, alone, of 
the public, that ought to control and regulate 
the government. The passions ought to be 
controlled and regu- 
lated by the govern- 


I: its defense of the United States Con- 


understanding of American democracy 
broadly speaking. There is a long tradi- 
tion, dating back at least to Alexis de 
Tocqueville and Gunnar Myrdal, of see- 
ing white Americans’ opposition to the 
advancement of racial “others” as mere 
irrational prejudice, fundamentally 
unconnected to the true essence of 
American culture, society, and politics. 
Bobo and Tuan show that whites’ atti- 
tudes are to a considerable extent based 
on racial predispositions and that those 
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ment” (/). 

Over two centuries 
later, this view contin- 
ues to be the conven- 
tional wisdom for 
many. Social scien- 
tists, however, have 
offered little insight 
into the role of emo- 
tion in shaping citi- 
zens’ political deci- 
sions. Do emotions play 


predispositions represent far more than 
irrational individual dislike. Rather, 
Americans’ racial attitudes connect impor- 
tantly with the ways that racial categories are 
constructed and institutionalized in social 
structure and political conflict. In this sense, 
they are a fundamental—if distasteful—part of 
American society and culture. 


References and Notes 

1. P.M. Sniderman, T. L. Piazza, The Scar of Race (Harvard 
Univ. Press, Cambridge, MA, 1993). 

2. P.M. Sniderman, E. G. Carmines, Reaching Beyond Race 
(Harvard Univ. Press, Cambridge, MA, 1997). 

3. J. M. Glaser, J. Polit. 56, 21 (1994). 

4. L. Bobo, in Eliminating Racism: Profiles in Controversy, 
P.A. Katz, Ed. (Plenum, New York, 1988), pp. 85-116. 

5. D. 0. Sears, in Eliminating Racism: Profiles in 
Controversy, P. A. Katz, Ed. (Plenum, New York, 1988), 
pp. 53-84. 

6. D.R. Kinder, L. M. Sanders, Divided by Color: Racial 
Politics and Democratic Ideals (Univ. Chicago Press, 
Chicago, 1996). 

7. H. Blumer, Pac. Sociol. Rev. 1, 3 (1958). 

8. The authors report that they “were unable to sample 
opinions among any significant number of American 
Indians, including the Chippewa themselves.” 


10.1126/science.1129404 


a substantial role? If 
so, when? And is such 
a role problematic? With each technological 
innovation in the mass media that offers 
politicians new means to play on the public’s 
emotions, these questions become more 
pressing. It is such questions that frame the 
topics Ted Brader addresses in Campaigning 
for Hearts and Minds. 

Brader, an assistant professor at the 
University of Michigan, begins by noting the 
development of two recent but largely distinct 
research programs in political science. One 
focuses on how mass communication affects 
citizens’ opinions. Using content analyses, 
experiments, surveys, and case studies, social 
scientists from various disciplines have 
shown—not surprisingly—that what politi- 
cians and news sources say can shape what cit- 
izens think and believe. Another fairly recent 
body of work shows citizens’ actions and opin- 
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ions depend in fundamental ways on their emo- 
tions. For example George E. Marcus, W. 
Russell Neuman, and Michael MacKuen have 
shown that people tend to become more politi- 
cally involved and interested when they feel 
enthusiastic and tend to become more atten- 
tive, information-seeking, and open to attitude 
change when they feel anxious (2). 

Brader impressively brings these two pro- 
grams together to probe the age-old concern of 
whether politicians can manipulate emotional 
whims to their advantage. He specifically 
focuses on television ads, which, he explains, 
“have become the principal tool of contempo- 
rary electioneering” in the United States. After 
reviewing prior related work and assessing 
“conventional wisdoms” about advertisements 
and emotions, Brader offers a psychological 
theory of emotional appeals. 


candidate advertisement into the commercials. 
Thus participants randomly saw either no 
advertisement, one of two ads that had little 
emotional content, or one of those same two 
ads but with elements meant to stimulate 
enthusiasm or fear. (He randomly varied other 
aspects of the ads such as the sponsor of the 
ad.) Importantly, the elements he used to stim- 
ulate enthusiasm or fear have nothing to do 
with the ads’ contents; rather, he manipulated 
emotional stimulation entirely by including or 
excluding certain images and music. For 
example, the enthusiasm ad added uplifting 
music and brightly colored images of children 
playing and smiling, whereas the no-enthusiasm 
ad had no music and used distant, expression- 
less pictures. 

Although there are some unexpected results, 
the bottom line is that overall Brader’s evi- 


Less evocative 


BOOKS ETAL. 


ads, candidates aim to stimulate enthusiasm, 
fear, or some other emotion through the use of 
music, color, and other visual cues. The author 
also demonstrates that emotional ads do not 
generally lack logic, facts, or policy discus- 
sion. Emotion should not be equated with a 
lack of substance. 

Brader’s experiments, content analysis, and 
data presentations are careful and methodical. 
He explicitly addresses a number of complica- 
tions, and Campaigning for Hearts and Minds 
shows the methodological state of the art in 
political communication research. The book 
also reflects the current status of research on 
emotion in political science. And this raises an 
important question: Should political scientists 
delve beyond behavioral manifestations of 
emotion to explore processing? Advances in 
neuroscience make it possible to focus on 


Fearful 


nt 
Ce 


Cueing emotion. In Brader’s enthusiasm experiment, one ad contained hopeful images and music while the other relied on the narration and less evocative 
imagery. (Both ads used the same positive script, here: “There’s good news in your neighborhood. The future looks bright for a generation of young people.”) The 
fear experiment compared the effects of the same unevocative imagery with those of threatening images and dissonant music. (In the negative narration for this 
pair, “It’s happening right now in your neighborhood. A generation of young people is in danger.”) 


Brader extends Marcus and colleagues’ 
aforementioned theory of affective intelli- 
gence (2) and research in psychology by 
Richard Lazarus, Jeffrey Gray, David Watson, 
and others (3—5) by incorporating the effect of 
political communication (e.g., advertisements) 
on different emotions and, consequentially, 
attitudes and behaviors. For example, ads that 
generate enthusiasm will increase political 
interest, participation, and confidence, whereas 
fear-provoking ads will cause people to reeval- 
uate their preferences and potentially change 
their opinions. 

Perhaps the book’s major contribution is to 
then describe what types of advertisements 
stimulate enthusiasm or fear, test the impact of 
such advertisements, and demonstrate the rel- 
evance of these advertisements to ongoing 
political campaigns. The author presents 
results from a set of compelling experiments 
that he implemented during the 1998 Mass- 
achusetts gubernatorial primary. Brader recruited 
a diverse set of participants to watch a prere- 
corded segment of nightly news along with the 
accompanying commercials (in a comfortable 
setting that resembled a living room). He then 
randomly inserted different versions of a single 


dence strongly supports his expectations. 
Enthusiastic ads motivate individuals to par- 
ticipate (e.g., willingness to volunteer, inten- 
tion to vote), and once participating, these 
individuals are likely to become even more 
committed to their prior preferences. The 
implication is that enthusiasm leads to politi- 
cal polarization by pushing voters to take 
action on behalf of their prior convictions. 
Fear ads have less participatory power— 
although to some extent they motivate sophis- 
ticated individuals. But, fear can open the 
gates of persuasion, and these ads tend to 
cause individuals to consider new information 
and possibly change their political prefer- 
ences. These findings have important norma- 
tive implications. Even though citizens clearly 
act on emotions when prompted to do so, their 
actions need not result in “bad” outcomes. 
Indeed, possible consequences of the use of 
emotional ads include a more active (enthusi- 
astic) populace and citizens who seek new 
information (to temper their fear). 

Brader follows up his experimental find- 
ings with a comprehensive content analysis of 
over 1400 candidate ads from 1999-2000. He 
shows that in more than three-quarters of their 


much more precise processes. If social scien- 
tists opt to take this route, which certainly 
offers some benefits, it is crucial they do not 
neglect the political, economic, and social 
environments that define their disciplines. 
Indeed, another feature missing from most 
research into political communication is atten- 
tion to common elements of the political con- 
texts, such as competition among messages 
(e.g., advertisements) over time. One chal- 
lenge for the next generation of political com- 
munication research is to incorporate such 
political contexts while probing deeper into the 
underlying psychological mechanisms. 
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COMMUNITY COLLABORATIONS 


Collaborative Ecological 


Restoration 


Warren Gold,'* Kern Ewing,? John Banks,’ Martha Groom,’ Tom Hinckley, 
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Faculty and students at the University of Washington forge interdisciplinary partnerships with the 


regional community to restore damaged habitats. 


human communities and ecological 

sustainability demands that we super- 
sede our traditional, balkanized disciplines (/). 
The field of ecological restoration showcases 
the necessity and merits of interdisciplinary 
approaches to real-world problems. Drawing 
on ecology, other life sciences, physical and 
social sciences, and the humanities, ecological 
restoration has a long history (2). 

The dust bowl of the midwestern United 
States in the 1930s was rehabilitated with the 
benefit of diverse skills that were grounded in 
ecological science. Subsequent experience in 
treating degraded landscapes 
has developed the field of eco- 
logical restoration with a focus 
on returning biological poten- 
tial and ecological integrity to 
damaged land (3, 4) [support- 
ing online material (SOM) 
text]. Successful restoration re- 
quires interdisciplinary partici- 
pation from land managers, 
policy-makers, scientists, and 
educators (5, 6). 

In 1999, the University of 
Washington (UW) began linking 
components of restoration ecol- 
ogy studies across academic 
units among its three campuses (7). The UW 
Restoration Ecology Network (UW-REN) was 
created with one-time internal funds to catalyze 
faculty and student participation across the tradi- 
tional boundaries of academic departments (8). 

Students from natural and social sciences 
and humanities can earn an academic certifi- 
cate (similar to a minor) in Restoration 
Ecology on any of the three UW campuses. 
Students learn how knowledge from their dis- 
cipline applies to restoration. All students par- 
ticipate in a year-long ecological restoration 


T he complexity of the interface between 
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project. This capstone project engages students 
in interactive hands-on learning, revealing the 
complexity of real-world solutions and creat- 
ing bonds between the university and the pub- 
lic (see photograph, below). 


UW-REN Capstone Projects 

The restoration ecology capstone lasts for one 
academic year (fig. S1) and is directed by 
diverse faculty (9). Students, ideally represent- 
ing a range of scientific and humanities fields, 
team up for aquatic and terrestrial restoration 
projects in neighboring communities. Part- 
icipation is limited to senior students, who 


Retrieving degraded land. UW-REN capstone students restoring 
streamside habitat. 


have sufficient knowledge of their field to con- 
tribute to a multidisciplinary team. 

Projects are selected by a panel of UW-REN 
faculty that meets prospective clients, who 
might represent local governments, schools, 
utilities, foundations, or community groups 
(table S1). Usually, projects are proposed by 
clients who would otherwise be working inde- 
pendently or with private-sector consultants 
but lack the financial or technical resources to 
do so. Some clients do have the resources but 
prefer to work with our students. Projects are 
selected on the basis of their regional ecologi- 
cal importance and potential for community 
and client involvement. These projects are also 
chosen for their educational value in represent- 
ing a range of ecological and restoration chal- 
lenges, and they must be feasible in size and 
scope for a student team to handle in an aca- 
demic year. Project sites have generally been 
less than one acre, although larger project sites 


have been restored by multiple student teams 
working in sequential years. 


The Capstone Experience 

The first course, in the autumn quarter, intro- 
duces restoration tools through lectures, field 
visits, and demonstrations by restoration profes- 
sionals. Topics range from scientific (site assess- 
ment, bioengineering, invasive species, and eco- 
logical theory application) to pragmatic (project 
management, grants, and community relations). 
Previous projects are studied to judge how well 
original goals were achieved and to understand 
the sustainability of outcomes. Teams of four to 
six students make their initial visit to the project 
site. This quarter builds shared foundations of 
scholarly and practical knowledge and promotes 
interdisciplinary communication in the teams. 

The second course, in the winter quarter, 
begins with an ecological site analysis during 
which students collect the information neces- 
sary to design and implement their project. 
Student teams then draft a proposal for the 
client, responding to the client’s stated needs 
by describing proposed actions and the ecolog- 
ical possibilities of the site. The proposal 
undergoes peer review by students in other 
capstone teams and UW-REN faculty before it 
is sent to the client. Next, by discussing and 
clarifying objectives and budget constraints 
with the client, the team negotiates a final pro- 
posal. A detailed work plan is then created that 
is rigorously tied to client needs, site condi- 
tions, prospects for long-term stewardship, and 
the underlying science. Students draw on their 
academic backgrounds to connect site-specific 
project implementation to previous knowledge 
represented in the literature (see table). That 
each restoration site is unique complicates this 
task but also proves the utility of providing a 
process to formulate unique responses. 

In the spring quarter, teams finish site prepa- 
ration, control invasive species, install habitat 
elements (including plants), describe the base- 
line conditions after installation, and develop 
long-term maintenance strategies. Teams craft 
stewardship plans, train their clients in the plan, 
and galvanize community support to ensure 
long-term project success. All project docu- 
ments are accessible online. A poster session, 
attended by current and former clients, students, 
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and neighbors of restored sites, ends the class. 
This event supports links among the university 
and surrounding communities and gives stu- 
dents an opportunity to communicate their work. 


Context of the Capstone 

The capstone experience is intended for stu- 
dents with prior training in basic concepts of 
restoration ecology, which they may acquire 
through a junior-level prerequisite course that 
is open to students from any academic back- 
ground. Beyond the academic knowledge of 
ecology, skills in project management, com- 
munication, collaboration, and negotiation are 
honed on the way to project success. 

Through the course of a project, students 
encounter and learn to address a variety of 
pragmatic challenges that can be found in many 
restoration projects. A project may require can- 
vassing neighbors, finding and managing 
volunteers, preparing education materials, 
posting signs, attending neighborhood meet- 
ings, applying for permits and small grants, 
soliciting materials, or installing public art. 

Capstone students apply scholarly under- 
standing developed in other classes to solve 
project problems. They must also take the crit- 
ical step of conveying that knowledge and its 
application to team members and the public. 
They learn to communicate the scientific basis 
of their restoration concept to clients, peers in 
various disciplines, and the public. 


Our Experiences and Outcomes 

Over the past 6 years, 155 students from all 
three UW campuses have participated in the 
UW-REN capstone. Of these, 95 have also 
pursued and been awarded the Restoration 
Ecology Certificate, which requires addi- 
tional coursework. The interest in applying 
academic knowledge to restoring damaged 
ecosystems occurs across many university 
departments (6, /0), although the natural sci- 
ences dominate our student population (table 
S3). UW-REN began with courses and faculty 
from natural science departments. Students 


from these programs quickly discover UW-REN 
offerings and are already inclined toward 
environmental themes. With the natural sci- 
ence foundation in place, we plan to engage a 
broader set of students and faculty and to add 
content and skills from fields such as ethics, 
anthropology, sociology, environmental his- 
tory, and policy (/1). Faculty will be recruited 
who can help students consider such social 
and humanistic dimensions. 

UW-REN offers a model for collaborative 
education among science and nonscience stu- 
dents. We feel it has been effective in (i) fostering 
their ability to apply scientific understanding to 
practical problems in the field, (ii) tackling chal- 
lenges in a multidisciplinary context and apply- 
ing their knowledge in a framework with other 
disciplines, and (iii) developing their abilities to 
communicate broadly. Through this process, we 
have watched students come to understand that 
science is necessary but not sufficient for suc- 
cessful restoration (6) and that success depends 
on effective communication and cross-cutting 
alignment of values of the project team, clients, 
and the surrounding community (/2). Lessons 
learned in working with community clients have 
improved our ability to prepare both students and 
clients for the capstone (/3). 

Our experience has not been without insti- 
tutional challenges. Although we have had 
some success, we face repeated challenges in 
developing ongoing support in terms of funds, 
faculty time, and resources (/4). 

Students have appreciated how the cap- 
stone process reflects the stresses of team- 
based problem-solving for clients. As one stu- 
dent explained, “The varied perspectives and 
priorities of our team and clients challenged us 
to find common ground. As a result, we 
learned to collaborate in order to develop cre- 
ative objectives and solutions to accomplish 
our hybrid goals. The UW-REN Capstone 
series is a uniquely holistic way of learning that 
combines the teaching and learning of the the- 
ory, application, and social aspects of science.” 
The experience of addressing real-world prob- 


INTERDISCIPLINARY CONTRIBUTIONS TO PROJECT GOALS 


Maintaining biodiversity 


Fields contributing 
disciplinary knowledge 


Conservation biology; ecology; 
landscape architecture 


Relevant literature on basic 
science 


Habitat and species diversity; 
ecosystem stability 


Wildlife and bird habitat 
restoration 


Relevant literature on 
real-world applications 


Examples of solutions Woody debris addition; 


microtopography; nest boxes 


Creating solutions through scientific knowledge. 


Goals 


Controlling erosion Enhancing sustainability 


Civil engineering; botany; 
geology; soil science 


Ecology; education; business; 
social sciences 


Soil properties; slope 
stability 


Succession; nonequilibrium 
coexistence; assembly rules 


Erosion control and 
stabilizing slopes 


Successional trajectories 


Bioengineering; wood 
structure installation 


Maintenance; monitoring; 
stewardship 


Students call upon knowledge from the basic and 


applied literature of a variety of disciplines as they craft solutions to the project's challenges. Table $2 


provides literature references and more examples. 
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lems helps students with future challenges 
(SOM text). Faculty benefit from new teaching 
and research collaborations. Finally, capstone 
projects forge rich bonds between the univer- 
sity and community, and provide exemplars of 
college courses that deal effectively with 
urgent societal issues and produce workable 
solutions (/5). That the community also finds 
these projects valuable is reflected in our 
repeat customer rate: More than 40% of 
our capstone projects have been with clients 
returning after previous projects (SOM text). 
We believe this capstone approach shows 
how research universities can become engaged 
with surrounding human and ecological com- 
munities while providing students with real- 
world opportunities to use their education. 
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CELL SIGNALING 


H,0,, a Necessary Evil for 


Cell Signaling 


Sue Goo Rhee 


was viewed as the inevitable but unwanted 

by-product of an aerobic existence. Given 
the damage inflicted by H,O,, it was assumed 
that the faster the elimination of this toxic waste, 
the better for the cell. However, as highlighted in 
recent forums (/, 2), we now know that mam- 
malian cells produce H,O, to mediate diverse 
physiological responses such as cell prolifera- 
tion, differentiation, and migration (3, 4). This 
has led to implications of cellular “redox” sig- 
naling in regulating normal processes and dis- 
ease progression, including angiogenesis, oxida- 
tive stress and aging, and cancer. This changing 
view of H,O, has partly evolved from a clearer 
understanding of redox chemistry as it affects 
biology—that is, cellular signaling that is linked 
to reductive-oxidative—based mechanisms. As 
the components and mechanisms involved in 
performing cellular redox chemistry become 
better defined, new areas of research are emerg- 
ing as to how the cells spatially and temporally 
channel H,O, into specific signaling pathways to 
achieve desired cellular outcomes. 

H,0, production has been studied most exten- 
sively in neutrophils. These immune cells defend 
a host against infections by engulfing and killing 
foreign microorganisms. The system relies on 
Nox [the NADPH (reduced form of nicotinamide 
adenine dinucleotide phosphate) oxidase com- 
plex], which generates millimolar quantities of 
HO, within the safe confines of an organelle 
[phagosome (see the figure)] for the purpose of 
microbial killing. In the classical phagocyte para- 
digm, stimulation of neutrophils by invading 
microoganisms leads to assembly at the plasma 
membrane of an active Nox complex, which com- 
prises a catalytic subunit—the integral membrane 
protein gp91 Phox—and regulatory proteins 
including the small guanosine triphosphatase 
Rac. This complex releases the reactive oxygen 
species superoxide (the free radical anion O, ) 
into the phagosome, and superoxide dismutation 
yields another reactive oxygen species, H,O,. 

We do know that in nonphagocytic cells, 
HO, affects numerous intracellular signaling 
pathways. Nonphagocytic cells express gp91 
Phox and its homologs (5), and these proteins are 
the major source of H,O, in cells stimulated with 
various growth factors and cytokines including 
platelet-derived growth factor (PDGF), epider- 


F or many years, hydrogen peroxide (H,O,) 
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Once considered lethal to cells, reactive 
oxygen species are now known to be involved 
in redox signaling pathways that may 
contribute to normal cell function as well 

as disease progression. 
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CELL SIGNALING 


H,0, production, protection, and signaling actions. Activation of various cell surface receptors activates Nox 
situated either in the plasma membrane or in the membrane of organelles such as endosomes to produce H,0,. 


To function as an intracellular signaling molecule, H,0.,, must be imported into the cytosol. Cytosolic H,0, 


enhances protein tyrosine phosphorylation by inactivating protein tyrosine phosphatases while activating protein 
tyrosine kinases. Transient protection of the H,O, signal from abundant cytosolic peroxiredoxin appears to result 
from the reversible inactivation of these enzymes through either hyperoxidation or phosphorylation. 


mal growth factor (EGF), insulin, tumor necrosis 
factor-a (TNFa), and interleukin-1 (IL-1) (3, 
4). However, the coupling of receptor activation 
to Nox activation in nonphagocytic cells still 
remains poorly understood. 

We are also trying to understand the mecha- 
nisms by which H,O, can modify the activity of 
key signaling proteins. Biological redox reac- 
tions catalyzed by H,O, typically involve the oxi- 
dation of cysteine residues on proteins, which 
may affect protein function. Phosphorylation of 
tyrosine residues in proteins is governed by the 
opposing activities of protein tyrosine phos- 
phatases and protein tyrosine kinases. The pro- 
tein tyrosine phosphatase family features a com- 
mon Cys-X-X-X-X-X-Arg active-site motif 
(where X = any amino acid). As a result of the 
invariant arginine, the conserved catalytic cys- 
teine possesses a low pK, (where K, is the acid 
dissociation constant) and exists as a thiolate 
anion with enhanced susceptibility to oxidation 
by H,O,. Oxidation of the essential cysteine 


abolishes phosphatase activity and can be 
reversed by cellular thiols. Reversible inactiva- 
tion of several different protein tyrosine phos- 
phatases has been demonstrated in relevant cell 
types stimulated with PDGE, EGE insulin, extra- 
cellular matrix molecules, and B cell receptor 
ligands (3, 6). Oxidative inactivation of these 
phosphatases and increased tyrosine phosphoryl- 
ation of target proteins were found to be depend- 
ent on H,O, production. Moreover, in TNF- 
a—stimulated cells, the resulting H,O, that is 
generated inactivates mitogen-activated protein 
kinase phosphatases. This in turn results in sus- 
tained activation of c-Jun N-terminal kinase, a 
subfamily of the mitogen-activated protein 
kinases that elicits specific cellular responses. 
H,0, also appears to promote tyrosine phos- 
phorylation by activating protein tyrosine 
kinases. For example, upon cell attachment to 
extracellular matrix and associated generation of 
H,0,, the tyrosine kinase Src becomes oxidized 
at two cysteine residues and thus becomes acti- 
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vated (7). Moreover, antioxidant treatment of 
cells that express an oncogenic form of Src 
(v-Src), or mutation of the oxidation-sensitive 
cysteine residues of v-Src, reduces the potency 
of v-Src to transform cells. This redox-dependent 
activation of Src occurs alongside dephosphoryl- 
ation of a carboxyl-terminal tyrosine, a modifica- 
tion that is needed to activate Src. 

For H,O, to serve as a signal—through 
modification of signaling proteins—its con- 
centration must increase rapidly above a cer- 
tain threshold. How can this occur in the pres- 
ence of antioxidant enzymes such as catalase, 
glutathione perioxidase, and peroxiredoxin? 
Whereas catalase is confined to the peroxi- 
some, several peroxiredoxin isoforms are 
abundant in the cytosol. Therefore, H,O, must 
be protected from destruction by peroxire- 
doxin in selected contexts. Indeed, multiple 
protective mechanisms of this type are being 
uncovered. During catalysis of H,O, reduc- 
tion, the active-site residue, Cys-SH, of perox- 
iredoxin occasionally reacts with two mole- 
cules of H,O,, and thus becomes hyperoxi- 
dized to Cys-SOOH. Consequently, peroxire- 
doxins are inactivated (8). This inactivation, 
which can be reversed by sulfiredoxin, an 
adenosine triphosphate—dependent enzyme, 
may represent a built-in mechanism to prevent 
damping of the H,O, signal. Prokaryotes do 
not express sulfiredoxin and their peroxiredox- 
ins are resistant to hyperoxidation. Thus, this 
regulatory mode appears unique to eukaryotes. 
Peroxiredoxins are also reversibly inactivated 
upon phosphorylation by cyclin B—dependent 
kinase during mitosis (9). 

Given the toxicity of H,O,, spatial and tem- 
poral regulatory strategies must exist to ensure 
that Nox activation occurs only where needed 
and that the H,O, signal is terminated in a timely 
fashion. Recent work on cells stimulated with 
TNF-a. suggests that Nox proteins are assem- 
bled in specific subcellular compartments within 
membranes such as lipid rafts (10). Localized 
Nox assembly also occurs at focal complexes, 
points of contact between a moving cell and the 
extracellular matrix, in response to migratory 
stimuli (//). The relevant oxidation targets that 
are presumably enriched in these microenviron- 
ments remain to be identified. 

Despite the increasingly sophisticated molec- 
ular descriptions of H,O, action, disturbingly lit- 
tle is understood about how H,O, is actually 
delivered to the cytosol. The classical neutrophil 
studies demonstrate that Nox releases H,O, into 
the phagosome, which is topologically equiva- 
lent to the extracellular space. How, then, does 
H,O, modulate intracellular signaling? In one 
scenario, Nox situated at the plasma membrane 
releases H,O, into the extracellular space as an 
autocrine factor to be imported into the cell. 
Alternatively, Nox proteins assembled at 
organelle membranes discharge H,O, into the 
lumenal space. For example, binding of IL-1 to 


its receptor in the plasma membrane triggers 
Rac-mediated Nox association with the IL-1 
receptor and endocytosis (internalization) of the 
receptor complex (/2). This results in superoxide 
production and conversion into H,O, in the 
lumen of the endosome. In addition, Nox iso- 
forms and their regulatory subunits have been 
detected in other cell organelles including the 
endoplasmic reticulum and nucleus. 

Regardless of whether the Nox complex 
is activated at the cell surface or within an 
organelle, the resultant H,O, must traverse the 
lipid bilayer to access the cytosol, where most if 
not all of its target proteins exist. Although H,O, 
is believed to diffuse freely across membranes, 
recent studies indicate that some membranes are 
poorly permeable to H,O,. Instead, H,O, trans- 
port might be regulated by changes in membrane 
lipid composition or by aquaporins (/3), which 
are diffusion-facilitating channel proteins for 
noncharged solutes such as water. 

The current picture of H,O, -based redox 
regulation of signaling processes is rapidly 
expanding beyond those issues focused on 
here. The development of a sensitive and spe- 
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cific probe for H,O, that allows quantitative 
and dynamic assessment in live cells, conspic- 
uously lacking in studies to date, will be a great 
boon for the study of this misunderstood and 
maligned molecule. 
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Seeking Room-Temperature 
Ferromagnetic Semiconductors 


Koji Ando 


Microelectronic circuits that retain their logic state when the power is off would permit entirely 
new kinds of computers. Ferromagnetic semiconductors might make this technology possible. 


onvolatile digital circuits that retain their 
Nee states even when their power 

sources are rapidly switched on and off 
would make possible a new type of computer. 
Although appearing to operate normally, these 
devices would actually be turned off most of the 
time, potentially changing the way we use digital 
devices. Such devices would allow, for example, 
year-long operation of mobile computers, an 
enormous number of tiny computers embedded 
all around us to help our daily lives, and ultra- 
high-density integrated circuits free from heat 
generation problems. To make this dream a real- 
ity, nonvolatile transistors are needed, but unfor- 
tunately this technology is nonexistent. Because 
transistors are composed of semiconductor 
materials, the ideal way to make nonvolatile 
transistors would be to use semiconductor mate- 
rials that are intrinsically nonvolatile. 
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Among several physical phenomena that 
produce nonvolatility, the most enticing is that 
of ferromagnetic hysteresis. In this effect, 
the material retains its magnetic state until 
reversed by a suitable magnetic field. Ferro- 
magnetism has been verified to offer high- 
speed, unlimited magnetization reversal, so it 
is perfect for transistor applications. However, 
the ferromagnetic materials used in digital 
devices such as hard disks and magnetic ran- 
dom access memory chips—iron, cobalt, and 
nickel and their alloys—are not semiconduc- 
tor materials. Hence, there is a continuing 
search for semiconductor materials that dis- 
play ferromagnetic properties. 

By replacing some of the positive ions of the 
parent nonmagnetic semiconductors by magnetic 
ions, one can make ferromagnetic semiconduc- 
tors such as (In,Mn)As and (Ga,Mn)As (/). But 
their ferromagnetic Curie temperatures (7_}—the 
temperature at which the ferromagnetism disap- 
pears—are 61 K (2) and 173 K (3), respectively, 
much lower than room temperature. In 2000, Dietl 
etal. (4) used a simple theory to estimate the 7, of 
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ferromagnetic semiconductors, and they pre- 
dicted that room-temperature ferromagnetic 
semiconductors might be created by substituting 
manganese ions in wide-band gap semiconduc- 
tors such as GaN and ZnO. Reports of GaN- and 
ZnO-based room-temperature ferromagnetic 
semiconductors (5, 6) soon followed. A Curie 
temperature as high as 940 K was reported for 
GaN with less than 10% manganese (5). Before 
long, reports began appearing of room-tempera- 
ture ferromagnetic semiconductors extended to 
materials based on other useful oxide insulators 
such as TiO, (7). Eventually there appeared a 
report of room-temperature ferromagnetism of 
pure HfO, (8), which contains no magnetic ions at 


Energy 


Nonmagnetic semiconductor 


Conduction band (s) 


Valence band (p) 


ductor” is typically only 10° emu. Still, this level 
of weak magnetization can be easily detected by 
extremely high-sensitivity SQUIDs (super- 
conducting quantum interference devices). 
However, careful attention should be paid to the 
fact that even tiny amounts of iron—as little as 1 
part per 2000 of the typical sample volume used 
in SQUID measurements—can generate these 
kinds of magnetic signals. A recent report (/3) 
cautioned that nonmagnetic HfO, thin films 
generate clear, ferromagnetic signals after con- 
tact with stainless steel tweezers. Because new, 
unexpected ferromagnetic materials can be syn- 
thesized in the nonequilibrium growth method, 
the possible presence of very small amounts of 


(d) 


Magnetic semiconductor 


Magnetism in semiconductors. (Left) In normal, nonmagnetic semiconductors, electronic energy does not 
depend on the spin direction. It is not possible to distinguish between spin-up and spin-down electrons. 
(Right) In magnetic semiconductors, the d electrons of magnetic ions influence the s and p electrons, and 
the conduction band and valence band are split depending on the spin direction (Zeeman splitting). This 
spin-polarized semiconductor band structure alters the absorption of clockwise-polarized (o*) and counter- 
clockwise-polarized (07) light (the MCD effect). For this reason, a magnetic semiconductor should display an 
MCD spectrum that reflects the band structure of the parent semiconductor. 


all. In the physics world it was long held that fer- 
romagnetism at room temperature could only be 
achieved with materials containing very high con- 
centrations of magnetic ions, so these reports 
came as a big surprise. 

Are we close to realizing the dream of non- 
volatile transistors? Half a decade has already 
passed since the first report of the high-T, ferro- 
magnetic semiconductor, but debate on the 
nature of the observed ferromagnetic signals still 
continues to rage (9—/2). Now that the excite- 
ment has passed, efforts are increasingly shifting 
to the task of thinking calmly about ways to eval- 
uate ferromagnetic semiconductors. 

The concentration of magnetic ions that can 
be introduced into a parent semiconductor is at 
most 10%, even with the use of a nonequilibrium 
film growth method. For this reason, the magne- 
tization displayed by thin-film specimens of 
what is claimed to be a “ferromagnetic semicon- 


an impurity phase is a critical problem that can- 
not be avoided in research on ferromagnetic 
semiconductors. Crystallographic evaluation 
methods, such as x-ray diffraction and transmis- 
sion electron microscopy, are often used in 
claims of the absence of impurities, but their 
detection sensitivity is extremely low relative to 
that of SQUID, and so they cannot be taken to 
provide conclusive evidence. 

To clear up this confusion, it is useful to con- 
sider the essential character of a magnetic semi- 
conductor. If magnetic properties and semicon- 
ductor properties existed independently of each 
other, magnetic semiconductors would have 
absolutely no value. It is the mutual interaction 
between magnetic properties (supported by the d 
electrons of the magnetic ions) and the semicon- 
ductor properties (supported by the s and p elec- 
trons) that is the essence of a magnetic semicon- 
ductor. With this s, p-d exchange interaction, 


one can control the magnetization by the electri- 
cal field (/4) or control the semiconductor opti- 
cal characteristics by the magnetic field (/5), for 
example. Because of the s, p—d exchange inter- 
action, the energies of the s and p electrons in a 
magnetic semiconductor become dependent on 
their spin state [up or down (see the figure)]. 
Therefore, confirmation of a spin-polarized 
semiconductor band is the litmus test of the exis- 
tence of a magnetic semiconductor. 

As a result of the quantum mechanical selec- 
tion rules for optical absorption, the spin polar- 
ization of a semiconductor band is directly con- 
nected with the appearance of magnetic circular 
dichroism (MCD) (/5). MCD is an effect in 
which clockwise-polarized and counterclock- 
wise-polarized light are absorbed differently. 
The important point here is that, whereas all 
magnetic materials can produce an MCD signal, 
a magnetic semiconductor should not only have 
a strong MCD signal but should also display an 
MCD spectral shape that reflects the band struc- 
ture of the parent semiconductor (/5). If the 
MCD spectral shape is different from what is 
expected, it is possible that the detected MCD 
signal is being produced by a magnetic material 
other than the expected magnetic semiconduc- 
tor. When the magnetic field dependence of the 
MCD signal of a magnetic semiconductor 
behaves ferromagnetically, we can conclude that 
we have a ferromagnetic semiconductor. 

Up to now, studies of the spin-polarized 
semiconductor band structures of materials 
claimed to be “ferromagnetic semiconductors” 
have been limited despite their importance. In 
the cases of GaN:Mn, GaAs:Cr, and ZnO:Ni— 
all of which produced ferromagnetic SQUID 
signals—no magnetic MCD signals were 
observed (16). It is very likely that some material 
other than a magnetic semiconductor was 
responsible for the observed ferromagnetic 
SQUID signal. The MCD spectral shape dis- 
played by a ferromagnetic ZnO:Co was very dif- 
ferent from that of paramagnetic (Zn,Co)O (/6). 
In the case of Ti0,:Co, no clear correspondence 
between the shape of the ferromagnetic MCD 
spectra (/7) and the expected band structure has 
been established. On the other hand, MCD spec- 
tral analysis has verified that (In,Mn)As (/8), 
(Ga,Mn)As (19), and (Zn,Cr)Te (20) are intrinsic 
ferromagnetic semiconductors. 

To make one’s way in a wilderness, a com- 
pass is necessary. In the quest for ferromagnetic 
semiconductors, that compass is MCD analysis 
and theoretical materials design. Recently, the 
number of experimental MCD studies has been 
increasing, but hardly any theoretical work has 
been done on interpreting the spectral shapes in 
MCD. On the other hand, there have been sub- 
stantial improvements in theories for predicting 
the 7, of ferromagnetic semiconductors (2/, 22). 
It seems that the wide-band gap semiconductors 
do not necessarily have an advantage in achiev- 
ing high 7. An alternative would be to look more 
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closely at materials having a longer exchange 
interaction distance between magnetic ions anda 
higher concentration of magnetic ions. All of the 
(In,Mn)As, (Ga,Mn)As, and (Zn,Cr)Te materi- 
als belong to this category. Recently, the T, of 
(Ga,Mn)As has been increasing from an initial 
value of 110 K (/) up to 173 K (3) by improving 
the growth procedures, and that of (Zn,Cr)Te has 
reached 300 K (20). 

Many different characteristics are considered 
when searching for the ferromagnetic semicon- 
ductors needed for nonvolatile transistors, 
including high 7., high carrier mobility, and 
intrinsic insulation conditions for carrier doping. 
To find our goal, it seems we must continue wan- 
dering, with the compass of MCD in one hand. 
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CHEMISTRY 


Targeting Specific C-H Bonds 


for Oxidation 


Rubén Mas-Ballesté and Lawrence Que Jr. 


main challenge in chemistry is the 
Ase of catalysts that can carry out a 

desired chemical transformation only 
upon chosen specific targets. These specific tar- 
gets can be one molecule in a mixture of chemi- 
cal substances or—perhaps even more diffi- 
cult—one specific site in a molecule with sev- 
eral reactive sites. On page 1941 of this issue, 
Das etal. (1) report an innovative approach to the 
design of a bio-inspired catalyst for the highly 
selective oxidation of CH bonds. 

Nature uses different strategies to achieve 
selective oxidations. For instance, the enzyme 
methane monooxygenase (MMO) selectively 
oxidizes methane to methanol, even in the pres- 
ence of other hydrocarbons with weaker C-H 
bonds (2). This specificity arises from a sieve- 
like phenomenon that hampers the access of 
molecules bigger than methane to the active site 
of the enzyme (see the figure, left panel) (3, 4). 
Besides size (and shape), enzymes can confer 
specificity by means of noncovalent interactions 
that orient the substrate in a particular manner 
(see the figure, right panel). Fatty acid desat- 
urases use a functional group within the enzyme 
active site to anchor the carboxylate end of a 
fatty acid and achieve dehydrogenation at two 
specific carbon atoms (5). 

Finding inspiration in nature, chemists have 
designed synthetic systems to emulate the 
specificity of biological oxidations. An initial 
approach reported by Breslow used steroid sub- 
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strates with a dangling, covalently 
attached photosensitive group to 
carry out light-activated remote oxi- 
dations at particular sites of the sub- 
strate (6). In the same vein, Stuk et 
al. appended a substrate to a man- 
ganese catalyst to achieve selective 
oxidation at specific sites of the sub- 
strate (7). These examples demon- 
strated the possibility of selective 
oxidations in a biomimetic system, 
but the covalent binding of substrate 
to the oxidant constrained the chem- 
istry to a maximum of a single 
turnover, such that only stoichio- 
metric (rather than catalytic) oxida- 
tion could be achieved. 

Subsequent efforts have incor- 
porated into the catalyst design moi- 
eties that can recognize substrates 
through noncovalent interactions. 
An approach extensively explored 
in the past used a metalloporphyrin 
as the catalytic center, with modifi- 
cations at its periphery to confer 
substrate selectivity. 

For example, Bhyrappa et al. 
attached dendrimers to the metal- 
loporphyrin to favor the oxidation 
of less sterically hindered linear al- 
kanes over cyclic alkanes (8); this 
approach emulates the size-recog- 
nition strategy of MMO. Mimick- 
ing the anchoring strategy of 
fatty acid desaturases, Groves and 
Neumann (9) embedded a metallo- 
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Enzymes use two main strategies—size/shape 
selectivity and substrate orientation—to 
achieve selective oxidation. Chemists have 
now developed a simple catalyst that mimics 
substrate orientation. 


Size/shape recognition 
(methane monooxygenase) 


Substrate orientation 
(desaturases) 


Size/shape recognition versus substrate orientation. Schematic 
representation of the two main strategies used by enzymes to 
achieve selectivity. 
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porphyrin catalyst into a phospholipid bilayer 
that selected for hydrophobic substrates and 
oriented them in such a way as to attain regio- 
selective epoxidation of sterols and polyunsatu- 
rated fatty acids. Combining both size-recogni- 
tion and substrate-orientation design elements 
(see the figure), Breslow et al. attached B- 
cyclodextrin groups to the metalloporphyrin 
periphery to bind substrates with appropriately 
sized and suitably positioned hydrophobic 
groups (/0). This binding step served to bring 
only the target C—H bond into close proximity 
to the metal center; hundreds to thousands of 
turnovers could be achieved (//). 

Das et al. now report the design and synthe- 
sis of a new bio-inspired catalyst for the highly 
selective oxidation of C—H bonds. Instead of a 
metalloporhpyrin, they have used a dinuclear 
manganese complex as the catalytic center, 
where the Mn ions are coordinated to a triden- 
tate ligand to which a carboxylic acid group (the 
tweezers in the right panel) is attached via a suit- 


ably rigid framework (the tether in the right 
panel). This group can interact via hydrogen 
bonds with the carboxylic group of the sub- 
strate, orienting it in such a way that only one 
oxidizable site is in the correct position to be 
attacked by the metal center. Following this 
strategy, not only is a high regioselectivity 
achieved but also an excellent stereoselectivity 
for the oxidation of a substrate that can adopt 
different conformations. The system reported in 
this issue mimics very efficiently (up to 700 cat- 
alytic turnovers have been observed) the strat- 
egy used by fatty acid desaturases with a simple 
and elegant catalyst design. 

The relative simplicity of the ligand 
design used by Das ef al. allows us to envis- 
age a new horizon of modified ligands. By 
tuning the nature of the tweezer and the 
length of the tether, a whole family of cata- 
lysts could be tailored to accommodate a 
wide range of substrates. This work, together 
with the recent results reported by Breslow 


and co-workers (/0, //), demonstrates the 
viability of molecular recognition strategies 
for catalytic oxidations. 
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MOLECULAR BIOLOGY 


Accurate RNA Siting and 
Splicing Gets Help from a DEK-Hand 


Tracy L. Kress and Christine Guthrie 


ukaryotic genomes are very economical, 
E= the information contained in a single 
genome is usually greater than the sum of 
its genes. Different proteins can be generated 
from a single gene [specifically, from the initial 
precursor messenger RNA (pre-mRNA) that is 
synthesized from a single gene] through a com- 
plex splicing process. By excising different 
introns and splicing together different exons— 
the noncoding and 
coding portions of 
transcribed pre- 
mRNA, _respec- 
tively—a gene can 
expand its repertoire 
of mRNA-encoded products. But this relies on 
the exquisite accuracy of the cellular “splicing 
machinery” to accomplish this task. On page 
1961 of this issue, Menedes Soares et al. (/) 
report that accurate identification of splice sites 
involves a factor that is not part of the known 
splicing machinery that carries out the cutting 
and pasting of mRNA, but is associated with 
chromatin, the material of chromosomes. 
The spliceosome is a large and dynamic 
ribonucleoprotein machine that removes introns 
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from pre-mRNAs (2). Understanding how the 5’ 
and 3’ ends of introns are accurately identified, 
despite relatively degenerate consensus seq- 
uences, remains a fundamental challenge. Errors 
of even a single nucleotide in the specification of 
an intron boundary can have catastrophic conse- 
quences—for example, by creating a frameshift 
mutation. This problem is magnified by the 
widespread existence of alternative splicing, in 
which a given splice site can be recognized in 
one biological context and ignored in another. 
Thus, the spliceosome must have evolved mech- 
anisms to balance precision and plasticity in 
splice site selection. Extensive genetic analyses 
in budding yeast have revealed a number of fac- 
tors that, when altered, degrade fidelity of splice 
site choice by allowing the use of mutated intron 
boundaries (3—7). To date, all of these factors are 
components of the core splicing machinery. 
Mendes Soares ef al. now report that DEK, a 
known chromatin-associated protein frequently 
overexpressed in tumor cells (8), plays a key role 
in splice site recognition. 

The 3’ ends of introns in metazoa are defined 
by a composite signal: a polypyrimidine (Py) tract 
upstream of an AG dinucleotide that is the 3’ 
splice site (see the figure). These sequences are 
recognized by the RNA-binding proteins U2AF® 
(9) and U2AF*° (/0-12), respectively, which can 
form a heterodimer. When the Py tract is suffi- 


Mature messenger RNA is prepared in 
eukaryotes by splicing together the protein- 
coding sequences of precursors. A chromatin- 
associated protein proofreads these splice sites 
in the RNA, ensuring accuracy. 


ciently long, the AG sequence is not required until 
the second of the two chemical steps involved in 
splicing (hence, the term AG-independent) (10, 
13, 14). In contrast, so-called AG-dependent 
introns, which have short or interrupted Py tracts 
(13), require U2AF* binding to promote or stabi- 
lize the binding of U2AF® to the weak Py tracts 
(10, 14, 15). Earlier work (10-12, 14) indicated 
that U2AF* might be sufficient to confer 
sequence specificity for the AG dinucleotides. 
However, Mendes Soares ef al. now report that 
purified U2AF* can promote binding of purified 
U2AF® to artificial RNAs containing a Py tract 
and either a wild-type 3’-AG or a mutated 3’ splice 
site, indicating that U2AF* alone is not sufficient 
to differentiate between the two. Although the 
authors do not pinpoint differences between the 
present and published assays for splice site dis- 
crimination that might account for this discrep- 
ancy, they show that specificity for the correct AG 
dinucleotide splice site can be restored upon addi- 
tion of a nuclear extract, made from a human cell 
line, that is depleted of the U2AF heterodimer. 
This provides the first evidence that an additional 
factor or activity is required for maximal 3’ splice 
site specificity. 

To identify this mystery factor, the authors 
used an ultraviolet light-induced oligonucleotide 
cross-linking assay to monitor 3'-AG splice site 
discrimination activity during the purification of 
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U2AF from nuclear extracts. They isolated a 50- 
kD protein identified by mass spectrometry 
as DEK, a protein originally associated with 
leukemia (8). Depletion of DEK from a nuclear 
extract reduced the ability of endogenous U2AF 
to discriminate between CG and AG dinv- 
cleotides, and this activity was substantially 
restored by the addition of recombinant DEK 
protein. Moreover, an interaction between in 
vitro-synthesized U2AF* and recombinant 
DEK protein was observed. DEK is known to be 
phosphorylated in cells; recombinant DEK is not 
phosphorylated but may become so upon addi- 
tion to nuclear extract or when added to in 
vitro-synthesized U2AF*. The authors show 
that phosphorylation of DEK is required for this 
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of DEK from nuclear extracts did not lead to 
splicing at mutant 3’ splice sites in either of two 
pre-mRNAs tested (one AG-dependent and one 
AG-independent). Notably, however, these 
DEK-depleted extracts failed to splice wild- 
type RNA substrates. This defect could be res- 
cued by the addition of recombinant DEK. 
Interestingly, removal of the AG-independent 
intron was blocked specifically at the second 
chemical step, arguing that in the absence of 
DEK, spliceosomes could still assemble and 
carry out first-step chemistry. In this case, 
DEK’s role would be to ensure that an AG dinu- 
cleotide has been identified at the 3’ splice site 
before allowing the chemical reactions to pro- 
ceed, as has previously been shown for U2AF** 
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RNA splice site discrimination requires DEK. (Top) Phosphorylated (P) DEK, which interacts with the splic- 
ing factor U2AF?>, allows accurate selection of the 3’-AG dinucleotide sequence and allows U2AF®° to rapidly 
assemble on the precursor mRNA. This may outcompete a precursor mRNA that contains a mutated 3’ splice 
site. (Bottom) It is conceivable that DEK, which associates with chromatin, may coordinate chromatin remod- 
eling/transcriptional regulation and mRNA splicing. Phosphorylation may operate as a switch for DEK 


between these two functions. 


interaction as mutation of two known serine 
phosphorylation sites in DEK both abolished 
DEK phosphorylation and inhibited its interac- 
tion with U2AF*>. When the authors repeated the 
cross-linking assay with recombinant DEK and 
purified U2AF heterodimer, maximal splice site 
discrimination activity required treatment of 
DEK with nuclear extract (which was depleted of 
DEK and U2AF through biochemical fractiona- 
tion). The authors speculate that the extract may 
contain a kinase required to phosphorylate DEK, 
and show that recombinant DEK can be phos- 
phorylated upon addition to a nuclear extract. 
Indeed, the increase in 3’ splice site discrimina- 
tion correlated with phosphorylation of DEK, 
whereas the phosphorylation-defective DEK 
mutants showed decreased discrimination. These 
results suggest that phosphorylation of DEK pro- 
motes its association with U2AF*°, which in turn 
enhances AG dinucleotide discrimination by the 
U2AF heterodimer. 

Does DEK function solely to suppress the 
use of noncanonical 3’ splice sites? Depletion 


(10, 14, 15). To test this hypothesis directly, the 
authors assayed an early step in spliceosome 
assembly—the binding of U2 small nuclear 
ribonucleoprotein (snRNP) to pre-mRNA or 
oligonucleotides containing a 3’ splice site. 
When a 3’-AG-containing RNA was used, U2 
snRNP binding was unaffected by the absence 
of DEK. However, when snRNP binding was 
assayed in the presence of RNA competitors, an 
AG-containing RNA could outcompete a CG- 
containing RNA, as long as phosphorylatable 
DEK was present. By this criterion, DEK might 
function early in spliceosome assembly, to 
enhance the discrimination between wild-type 
and noncanonical 3’ splice sites. 

The authors conclude that DEK acts to 
proofread the 3’ splice site. How does this fit 
into the current view of fidelity mechanisms? 
Proofreading factors identified to date play 
important roles in the splicing of wild-type 
introns (3). That is, like DEK, their function is 
not restricted to fidelity per se. As components 
of the splicing machinery, they influence the 
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rate of interconversion between alternative 
conformations of the spliceosome during the 
chemical steps of splicing, likely underlying 
the extent to which aberrant splice sites are 
used (3, 6). These conformational changes are 
often coupled to an irreversible step, such as 
the hydrolysis of ATP (adenosine triphosphate) 
by DEAD-box RNA-dependent ATPases, in a 
kinetic proofreading scheme (7). As the 
authors point out, it will be exciting to investi- 
gate the possibility that DEK is involved in 
kinetic proofreading by perhaps modulating 
the activity of an ATPase. Interestingly, recent 
results from yeast implicate Prp22, a DEAD- 
box RNA-dependent ATPase splicing factor, in 
the rejection of mutated 3’ splice sites immedi- 
ately before exon ligation. By slowing an ATP- 
dependent “timer,” mutations that reduce the 
ATPase activity of Prp22 can allow aberrant 
introns to be spliced (6). 

Perhaps the most provocative questions 
raised by this work involve the potential link 
between splicing and chromatin remodel- 
ing/transcriptional regulation, given that DEK 
is known to be chromatin-associated. In- 
triguingly, phosphorylation of DEK, which is 
required for splicing activation, reduces its 
affinity for DNA (/6). Conceivably, the phos- 
phorylation state of DEK acts in a switchlike 
fashion to alter the association of DEK 
between proteins involved in chromatin re- 
modeling and transcriptional regulation and 
proteins involved in RNA splicing. It will also 
be important to determine whether DEK plays 
a role in alternative splicing, and if so, how 
phosphorylation may influence this activity. In 
addition to elucidating the role of DEK in the 
fidelity of splice site choice, such information 
could help explain how the misregulation of 
DEK might underlie disease. 
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The Neutron Spin- 
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Improvements in neutron scattering 
techniques allow precise measurement of 
dynamics in magnetically ordered condensed 
matter materials. 
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dramatic manner when these 
particles form a periodic 
solid. Although the energy E 
of a free particle of mass 
relates to its velocity V (or 
momentum P = MV) through 
the well-known relation 
E = MV?/2 (or E= P?/2M), in 
a crystal this relation is re- 
placed by the more general 
relation E(P), which is called 


Cie 


aC 


a dispersion relation. This dis- Spin oriented Spin oriented Spin oriented Rotated by x 
persion relation can take very parallel to perpendicular after precession flipper around 
: . guide field to guide field in yz plane y axis 
intricate forms that encode inzdirection iny direction 


the periodic environment as 
well as the interactions bet- 
ween the elementary building 
blocks of the solid. Since the 
work of Landau on Fermi liq- 
uids (/), this dispersion rela- 
tion has been recognized as a basic and essential 
concept in understanding weakly interacting 
quasiparticles, such as, among others, phonons 
responsible for lattice vibrations, electrons 
responsible for electrical conductivity, and 
magnons responsible for spin waves. Now, as 
reported on page 1926 of this issue, Bayrakci et 
al. (2) have taken the measurement of the proper- 
ties of such quasiparticles to a new level of 
sophistication. 

The validity of Landau’s approach has been 
directly verified for electrons by means of angle- 
resolved photoelectron spectroscopy (ARPES) 
and for phonons with both inelastic neutron and 
x-ray scattering techniques. Magnon dispersions 
can only be measured by inelastic neutron scat- 
tering, however, which therefore represents the 
technique of choice to validate or invalidate our 
current theoretical understanding of magnetic 
phenomena in condensed matter physics. 

True noninteracting quasiparticles can exist 
only at both zero temperature and at the mini- 
mum of the dispersion relation, where no decay 
can occur. These noninteracting particles are 
characterized by infinite lifetime, which is 
related, through the quantum uncertainty princi- 
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ple, to a zero linewidth in an energy spectrum 
(energy width). As one moves away from the 
energy minimum, scattering between quasiparti- 
cles (phonon-phonon, electron-electron, magnon- 
magnon, or mixed interactions) becomes possible 
and, as a consequence, their lifetimes diminish 
and their energy spectral widths increase. 
Experimentally, it is much more difficult to 
determine lifetime effects than dispersion rela- 
tions. This is mainly a result of the fact that in 
most conventional scattering experiments, such 
as those that use neutron three-axis spectrome- 
ters (TAS), the energy resolution is inversely 
proportional to the beam intensity. Ideally, one 
would like to have a method that allows one to 
disentangle the resolution from the intensity. 
Such a technique exists for neutron scattering 
and is based on the spin-echo effect, where one 
uses the “independent” spin degree of freedom 
of the neutron (instead of its momentum or 
wavelength) to achieve high resolution and high 
intensity (see the first figure). The neutron spin- 
echo technique was invented by Mezei in 1972 
(3) and was first realized at the high-flux reactor 
of the Institute Laue Langevin (ILL) in Gre- 
noble, France, in 1978. As the illustration shows, 
both a polarizer and a spin flipper control and 
guide the neutron spins into a first precession 
coil. Because the number of precessions depends 
upon the velocity (indicated by colored arrows) 
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Spin gymnastics. Measurement of the neutron spin polarization after interaction with a sample provides information about 
dynamical processes occurring in the investigated materials. 


of each neutron, the polarization is lost at the 
sample position. After reversal of their spin 
around the y axis (by means of a -flipper), the 
neutrons pass through a second precession coil 
and a spin analyzer, to be finally detected. 
Although the original polarization is fully recov- 
ered (the spin echo) for elastically scattered neu- 
trons, neutrons scattered inelastically by an exci- 
tation will result in a tilted polarization. The 
magnitude of this tilt is a measure of dynamical 
processes in the sample. 

The main applications of this technique have 
been very-high-resolution quasielastic measure- 
ments of mesoscopic systems in the fields of soft 
matter, biology, and glasses. It turned out that 
traditional spin-echo is, for various reasons, not 
well adapted for measurements of dispersion 
curves away from their minima. To overcome 
this limitation, it was suggested that traditional 
TAS could be combined with a modified version 
of the spin-echo technique. After this proposal, 
several devices were realized at the Hahn- 
Meitner-Institute in Berlin (4), ILL (5), and the 
Forschungsreaktor Miinchen (FRM) in Munich 
(6), which enabled high-resolution lifetime 
measurements of phonons in various materials. 

Bayrakci et al. present lifetime measure- 
ments of magnetic excitations obtained on the 
latest TAS—spin echo instrument built at the 
newly operational FRM-II reactor in Munich. 
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Even for the prototype antiferromagnetic MnF, 
compound studied by these authors, it is shown 
that the very high resolution (AE ~ 1073 meV) 
that has been achieved allows one to unravel 
important discrepancies with well-established 
theories in the field of magnetism, as shown in 
the second figure. In this illustration, the vertical 
axis represents the measured energy width of a 
magnon excitation as a function of temperature 
and momentum. This quantity is inversely pro- 
portional to the magnon lifetime. For intermedi- 
ate momenta and temperatures (green-yellow), 
the data can be explained by considering magnon- 
magnon scattering processes. On one hand, addi- 
tional and unexpected scattering processes occur 
in the low-temperature regime (red), suggesting 
that new relaxation channels need to be consid- 
ered. On the other hand, existing theoretical cal- 
culations of the magnon lifetime fail to repro- 
duce the measured lifetime in the high-tempera- 
ture and large-momentum regime (blue). 

This work of Bayrakci et al. demonstrates 
that the TAS—spin echo technique proposed sev- 
eral years ago is now mature and ready for appli- 
cation to more complex materials like high-tem- 
perature superconductors or colossal-magne- 
toresistive manganites, where a delicate inter- 
play exists among the magnetic, lattice, and elec- 
tronic degrees of freedom. In a sense, neutron 
scattering is now seeing a revolution similar to 
the recent one in the field of ARPES with the 
development of high-resolution detectors (AE ~ 
1 meV) and which has had a tremendous impact 
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Mapping magnetic interactions. Three different 
scattering regimes of magnons in the antiferromag- 
net MnF, as determined by Bayrakci et al. using the 
TRISP spin-echo spectrometer at FRM-II. 


on our understanding of electron interactions in 
strongly correlated materials [for reviews, see 
(7, 8)]. The unique capability to perform neutron 
scattering experiments with even higher resolu- 
tions over full dispersion relations opens com- 
pletely new opportunities to test existing theoret- 
ical models of, and explore new phenomena in, 
solid state physics. 

Finally, it is worth mentioning that the 
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TAS-spin echo device represents one of the 
many examples of recent technological advances 
realized at various neutron centers worldwide. 
This encompasses the development of new 
sources such as the third-generation spallation 
neutron sources in the United States, Japan, and 
hopefully soon in Europe, as well as the develop- 
ment of advanced neutron guide systems, polar- 
ization devices, new detectors, high-field facili- 
ties, and so on. These few examples illustrate that 
neutron scattering remains an active field and 
will continue to represent a unique and precious 
technique for the study of materials with novel 
electronic and magnetic properties. 
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Climate Change and Crop Yields: 
Beyond Cassandra 


David Schimel 


he effects of increasing atmospheric car- 

| bon dioxide (CO,) concentrations on cli- 
mate have been a source of worry for 
decades, but the positive effect of increasing 
CO, levels on crop growth has been a silver lin- 
ing in the climate change cloud. Hundreds of 
studies, some dating back decades, have shown 
that most major food crops respond positively to 
increasing CO, concentrations, because of the 
direct stimulatory effect of CO, on photosynthe- 
sis and the indirect effect of decreasing the 
water requirements of crops. The former effect 
should make crops more productive and the lat- 
ter more drought-tolerant. Yet a new analysis of 
recent and far more realistic studies based on the 
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free-air concentration enrichment (FACE) tech- 
nique, reported by Long ef al. on page 1918 of 
this issue (/), sounds a cautionary note. 

In FACE studies, a sophisticated, computer- 
controlled micrometeorological system en- 
riches the air above a large plot to a target level, 
but does not otherwise alter the microclimate 
and growing environment. There is a large net- 
work of FACE studies around the world (see 
the figure), but relatively few have used food 
crops; most have been in natural or seminatural 
ecosystems. Long et al. synthesized the results 
from agronomic FACE studies and find the 
effects of increased CO, concentrations on 
crop yields to be no more than half of the levels 
expected from earlier laboratory and open-top 
chamber studies. 

What is the significance of this result? In 
anticipating the impacts of global change on 


An analysis of recent data from a wide variety of 
field experiments suggests that previous studies 
overestimated the positive effects of higher 
carbon dioxide concentrations on crop yields. 


agriculture, crop models have been major 
tools. Modern crop models are widely used in 
agricultural research and planning and have 
been shown to be robust and accurate under a 
wide range of conditions in today’s world. 
These models have been extended to study cli- 
mate change scenarios, including simulated 
responses to climate and CO, and a wide range 
of assumptions about technological change 
and adaptation by farmers and markets. 

These modeling results suggest that, at 
least in some regions, the future of agriculture 
may be one of the brighter spots of climate 
change. Although guarded in their remarks, 
most assessments of climate change impacts 
on agriculture have concluded that the global 
impacts are relatively small (although they 
may be severe in some areas) (2). This is 
largely because the assumed effects of CO, on 
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The global network of FACE sites. For a full list of sites, including contacts, see http://cdiac.ornl.gov/programs/FACE/whereisface.html. 


crop growth and water use offset all or part of 
the negative impacts of warmer temperatures 
and rainfall changes (3). Depending on the 
region, this effect may ameliorate negative 
impacts of climate change or even allow for 
increases in crop yield. 

Long et al. find that most current models 
are based on literature suggesting 20 to 30% 
increases in photosynthesis and yield. This lit- 
erature also suggests substantial effects in 
maize. Maize, a C4 photosynthesis pathway 
crop, should have no direct response to CO, 
and should only respond via decreased water 
requirements (C4 photosynthesis uses a CO,- 
concentrating mechanism at the cellular level 
that is less sensitive to atmospheric CO, than 
the more common C3 pathway). 

The extant FACE results in food crops are 
different from earlier lab and chamber results 
in a consistent and disturbing way. In survey- 
ing the effects of CO, concentration in the 
more realistic FACE studies, Long et al. con- 
clude that the actual effect of CO, concentra- 
tion is about half of that in the experiments 
used to develop climate change models. 
Despite earlier reports of substantial effects in 
C4 crops, Long et al. found no increase in yield 
at all in this case, although there was some evi- 
dence for increased drought tolerance. 

If Long ef al. are right, then even the cur- 
rent guarded assessment of the impacts of cli- 
mate change on food crops may be optimistic. 
If an assumption of a nearly 30% increase in 
yield at 550 parts per million by volume 
(ppmv) of CO, is buried in current models and 


if the true response is half that, then the global 
pattern of modeled agricultural yields could be 
an overly rosy picture. Areas with modeled 
increases might see no change, and areas with 
no change or reductions might in reality expe- 
rience crop failure. 

In 2001, the Intergovernmental Panel on 
Climate Change assessment concluded that 
“experiments have shown that relative 
enhancement of productivity caused by ele- 
vated CO, usually is greater when temperature 
rises but may be less for crop yields at above- 
optimal temperatures [...] Although the bene- 
ficial effects of elevated CO, on the yield of 
crops are well established for the experimental 
conditions tested, this knowledge is incom- 
plete for numerous tropical crop species and 
for crops grown under suboptimal conditions.” 
[see p. 237 in (2)]. This conclusion may be 
overly optimistic, and concerns about food 
security, especially in regions with high tem- 
peratures, poor soils, and dry conditions, may 
need to be revisited. 

The report by Long et al. may move 
impacts on agriculture higher up on the list of 
pressing concerns about climate change, but 
this is not where the authors conclude their 
analysis. Rather, they note that the theoretical 
effect of increased CO, on yield is much larger 
(36% at twice the pre-industrial CO, levels) 
than the realized effects on yield. The observed 
effect on yield is typically closer to 20%, with 
effects on biomass and yield being even 
smaller (20 and 13%, respectively). 

This result calls for a cautious use of mod- 


els, but also suggests an opportunity. Some set 
of biological processes appears to operate to 
reduce the impact of CO, on realized gains in 
biomass and yield below that expected from 
the effects on photosynthesis. The processes 
that cause this reduction are poorly understood 
and are not included effectively in models to 
this day. Long et al. conclude that this is an 
opportunity for crop breeders to develop vari- 
eties that can take advantage of the increase in 
atmospheric CO, concentration. Crop breed- 
ers, especially when able to use genetic engi- 
neering techniques, have produced an extraor- 
dinary array of crops for a multitude of uses 
and environments. 

If successful, breeding for increased CO, 
concentration could become a major factor in 
the agricultural sector and could move agri- 
culture back off the list of the most pressing 
concerns about climate change. Although 
Long et al. begin Cassandra-like by raising a 
new concern, they conclude with a creative 
suggestion for adaptation to increased CO, 
levels using the proven creativity of agro- 
nomic science. 
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AAAS NEWS & NOTES 


INTERNATIONAL 


Pioneering AAAS Project Uses 
Satellites to Aid Human Rights 


On 14 May 2004, a satellite passing 450 km 
over Zimbabwe captured an image that 
included portions of the hardscrabble Hatcliffe 
settlement—more than 700 homes and other 
buildings scattered across the grasslands just 
north of the nation’s capital city, Harare. Less 
than 16 months later, on 2 September 2005, a 
satellite sent back a stunning new picture: The 
pattern of red-dirt roads was still visible, but the 
buildings were gone. 

The pictures were among the first collected 
by a commercial satellite company as part of a 
year-long AAAS pilot project to assess how 
satellites and other geospatial technology can be 
used in support of human rights. Already the 
project is having an impact: Amnesty Interna- 
tional and Zimbabwe Lawyers for Human 
Rights used satellite pictures of the destroyed 
Porta Farm settlement in a 31 May report on the 
Zimbabwe government’s destructive campaign 
to uproot opposition, generating extensive 
newspaper and broadcast coverage in Europe, 
Africa, and the United States. 

Otto Saki, an attorney with the Zimbabwe 
lawyers group, said in e-mailed remarks that the 
satellite images may have “a phenomenal 
impact” in legal action over the systematic 
destruction of villages under the government of 
President Robert Mugabe. 

“New satellite technology provides the 
unprecedented ability to document human 
rights abuses via a virtual ‘eye in the sky’,” said 
Larry Cox, executive director of Amnesty 
International USA. “With satellite projects like 
this one, we are gaining the ability to detect, 
publicize, and even prevent future human 
rights abuses from occurring in Zimbabwe and 
around the world.” 

Geospatial technology is not new—the 
development of hot air balloons and airplanes 
brought the use of aerial cameras; intelligence 
agencies have long used spy satellites; and 
scientists use such tools to study the weather 
and forest fires. But images from government 
satellites are not usually available in a timely 
way to human rights groups, and new images 
from privately owned satellites can cost 
$2000 or more. 

Last December, the AAAS Science and 
Human Rights program obtained a $110,000 


Portions of the Hatcliffe settlement outside of Harare, 
Zimbabwe, on 14 May 2004. 


The same part of Hatcliffe on 2 September 2005— 
with scores of buildings demolished. 


grant for a pilot project from the John D. and 
Catherine T. MacArthur Foundation. 

Imaging satellites and other geospatial 
technology have been “vastly underutilized” in 
human rights work, said Lars Bromley, who has 
guided the project as a senior program associate 
in the AAAS Office of International Initiatives. 
“By handling all the technical and analytical 
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aspects, AAAS allows groups like Amnesty and 
the lawyers to match their issue expertise with 
the power of the imagery. If we can smooth this 
relatively complicated process, the NGOs 
working to protect human rights around the 
world can see lots of benefits.” 

Among key partners in the effort are 
Amnesty International USA; the United 
Nations Special Advisor for the Prevention of 
Genocide; the Natural Resources Defense 
Council; the U.S. Holocaust Memorial 
Museum; the U.S. Campaign for Burma; and 
EQUITAS, the international center for human 
rights education. DigitalGlobe, a Colorado- 
based satellite image company, has been a 
partner and has provided images at a dis- 
counted price from its high-resolution Quick- 
Bird imaging satellites. Another satellite 
image company, GeoEye, also has provided 
generous support. 

Bromley and others say that sophisticated 
commercial satellites and the increasing power 
of personal computers and the Internet have 
made the data more available than ever. The 
costs are likely to fall in coming years as more 
commercial imaging satellites are launched. 

After AAAS finishes analyzing images from 
Hatcliffe, Porta Farm, and two other settlements, 
the project will turn to test cases in the Darfur 
area of Sudan and Burma. Published reports on 
the project’s interest in allegations of wholesale 
destruction in Burma’s Karen State have elicited 
a sharp rebuke in a newsletter controlled by the 
nation’s government. 

Over the years, AAAS’s Science and Human 
Rights program has pioneered a number of 
initiatives to develop and promote the use of 
scientific methods to advance human rights, 
including forensic sciences, statistics, and 
social science methods. If the geospatial pilot 
project is successful, AAAS and its partners 
will explore how to make it permanent. 


EDUCATION 


Senior Scientists and 
Engineers Bring 
Experience to Class 


Don Rea had been a research chemist and a 
research director at NASA and the Jet Propulsion 
Laboratory. He had never dissected a frog, 
though—but there he was, closely watched by 
a troop of middle-schoolers, gamely grabbing 
a scalpel. 
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“T had also never 
looked at protozoa 
before, but I was able 
to set up the micro- 
scopes and help the 
kids observe these,” 
Rea added proudly. 

Last September, 
Rea was one of nine 
volunteers who entered 
seven middle schools 
and a high school in 
Montgomery County, Maryland, to help students 
with their science lessons, support teachers 
with a little extra information, and even build a 
few experiments. Rea organized the volunteer 
program as a project of Senior Scientists and 
Engineers (SSE), a AAAS-affiliated group of 
retired researchers dedicated to public service. 

Rea and the SSE thought the schools might 
be a good place to fight back against science 
illiteracy and apathy. “The general populace is 
not well tuned in to science, and frequently takes 
positions that are contrary to the interest of good 
scientific development,” Rea said. 

“At a time when many political leaders—and 
many parents—recognize the need for a renewed 
national commitment to education in science, 
technology, and mathematics, this program is an 
inspiration,” said Alan I. Leshner, AAAS chief 
executive officer. “If retired professionals were 
able to form partnerships with schools nation- 
wide, this approach could help improve science 
literacy and give students further evidence that 
these are exciting, important fields.” 

After discussions with the Montgomery 
County schools, Senior Scientists and Engineers 
and AAAS signed an agreement with the schools 
for a 1-year pilot program. SSE then sent out a 
call for retired AAAS members who could help 
out in classes once a week. 

Rea said the idea was to broaden science’s 
appeal to the first generation of the 21st cen- 
tury. “We’re aren’t interested in providing 
tutoring to students who want to become sci- 
entists,” he explained. “Our objective is to try 
to increase science understanding of the 
entire class.” 

Teachers and students alike welcomed the 
newcomers. “It has been wonderful to have a 
person from the community who cares about 
science and knows a lot about scientific con- 
cepts to share his expertise with the students,” 
said Michelle Stanton, a Montgomery Village 
Middle School teacher. 


Don Rea 


AAAS 


AAAS Annual Election 


The slate of candidates for the 2006 election 
of AAAS officers will be announced in News 
and Notes in the 28 July issue of Science. 


Nina Hoffman, a teacher at Argyle Middle 
School, worked with volunteer Dave Weiss, a 
retired mechanical engineer. “His rapport with 
the students was wonderful,” she said. “When 
we’re doing labs, it’s great for the kids—now 
they have two adults to ask questions. And he just 
didn’t go and answer their questions. He asked 
questions in response to their questions to really 
make them think. They really do appreciate his 
knowledge and what he brings to the classroom.” 

Rea noticed that, too. “In one case, the 
teacher asked me to make an observation and I 
got a round of applause, which was totally 
unexpected,” he joked. 

Volunteers built a wave machine for one class 
and a metal ramp for a physics demonstration in 
Hoffman’s class, but Rea said that the volunteers 
focus mostly on supporting the teacher’s lessons 
rather than introducing new activities. 

The program won rave reviews in its first year, 
and both teachers and volunteers said that a year of 
experience would make them even more effective. 
It will be expanded to more Montgomery County 
middle schools this fall. SSE also is sharing 
ideas with similar programs around the country, 
Rea said. 

“There are over a million scientists and 
engineers over 60 around the country, and the 
number is probably growing every year,” Rea 
said. “That’s a big pool we can draw on to 
improve science teaching in schools.” 

For more information on Senior Scientists 
and Engineers, visit www.seniorscientist.org 
—Becky Ham 
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New AAAS Dues Rates 
Approved for 2007 


The AAAS Board of Directors has approved a 
dues increase for 2007. The Board authorizes 
increases to cover two kinds of expenses: 
unavoidable costs associated with running AAAS 
and publishing Science, and new expenses that 
add value to membership. Postage and paper 
increases and improving online resources are 
examples of the kind of expenses that the Board 
anticipated in setting the 2007 rates. 

The new rates are effective for membership 
terms beginning after 31 December 2006. As 
listed below, they do not include postage or taxes 
for international members, which is additional. 


¢ Regular professional members $142 

¢ Postdocs and K-12 teachers $99 

¢ Emeritus members who receive $110 
print Science 

* Students $75 

+ Patrons $310 

¢ Supporting and Emeritus members $56* 


who do not receive Science 
The Board also set the institutional sub- 
scription rate for print Science at $360 for high 
school and public libraries and $710 for all other 


COMMUNICATION 


Screeners Needed 
for AAAS Science 
Journalism Awards 


Scientist volunteers are needed to review entries 
in the prestigious AAAS Science Journalism 
Awards program. Scientists residing in the 
Washington, D.C., area, or who will be in the 
area in mid-August to mid-September, are 
invited to help screen print, radio, and television 
reports for scientific accuracy. If interested, 
please contact Lonnie Shekhtman (202-326- 
6434; |shekhtm@aaas.org) in the AAAS Office 
of Public Programs. 

Winners of the awards, which are spon- 
sored by Johnson & Johnson Pharmaceutical 
Research & Development, L.L.C., will be 
honored at a ceremony in February 2007 at 
the AAAS Annual Meeting in San Francisco. 
Members of the screening committees will be 
recognized in the awards booklet distributed 
during the ceremony. 


institutions. For further information, including 
subscription rates for Science Online, librarians 
should contact AAAS or their subscription agents, 
or go to www.sciencemag.org/subscriptions/ 
inst_sub.dtl on the Web. 

All members will be advised of the new dues 
rates on their renewal notices for 2007. Member 
dues and voluntary contributions form the 
critical financial base for a wide range of AAAS 
activities. For more information, contact the 
AAAS Membership Office at 202-326-6417, 
www.aaas.org/membership/. 

* Supporting member dues rate is set by the 
membership department 


SCIENCE VALUES 


NAS, AAAS Launch 
Integrity Web Site 


In a move to provide practical resources on 
ethics and integrity, AAAS and the National 
Academy of Sciences have joined to offer an 
online bibliography on scientific integrity. 

The collaboration will bring an array of 
articles, essays, and other materials published 
by NAS and AAAS to young scholars, working 
scientists, and educators. The bibliography is 
based on the idea that trust and accountability 
are integral to the research enterprise and the 
sharing of scientific information. 

Visit the bibliography at www.aaas.org/ 
integrity. 
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Bacterial Diversity in Tree Canopies 
of the Atlantic Forest 


M. R. Lambais,”* D. E. Crowley,* J. C. Cury,? R. C. Biill,? R. R. Rodrigues” 


losphere, is one of the most common 
habitats for terrestrial microorganisms (/), 
but almost nothing is known about the diversity 
of microorganisms that inhabit this environment 
(2). Here, we report a survey of bacterial di- 
versity in the leaf canopy of a tropical Atlantic 
forest. The Atlantic Forest of Brazil is a bio- 
diversity hotspot that has been reduced to less 
than 8% of its original size over the past 4 
centuries and is considered to be the oldest 
forest on the planet, containing about 20,000 
vascular plant species, of which about one-half 
are endemic (3, 4). Initially we compared the 
bacterial communities on the leaf surfaces of 
nine tree species (table S1) by using a molecular 
method that generates a DNA fingerprint of the 
predominant bacteria from their 16S ribosomal 
RNA (rRNA) gene sequences (5) (fig. S1). Our 
results showed that bacterial communities from 
the same tree species varied but could be 
consistently grouped by discriminant analyses 
(table S2). These data are consistent with pre- 
vious research showing that different plants 
select for distinct microbial communities (6). 
To identify the bacteria in the phyllospheres of 
Trichilia catigua, T. clausenii, and Campoma- 
nesia xanthocarpa, we analyzed 418 partial 
DNA sequences encoding 16S rRNA genes (5). 
Comparison of homologous and heterologous 


T= leaf surface, also known as the phyl- 
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coverage curves indicated that all three phyllo- 
sphere communities were significantly different 
in their bacterial species compositions (fig. $2). 
For all clone libraries, the sample size was suf- 
ficient to recover the most abundant deep phylo- 
genetic groups. At evolutionary distances (D) 
higher than 0.20 (the cutoff value for group 
sequences at the phylum level), the homologous 
coverages were greater than 96% (fig. S2). At a 
D of 0.03, corresponding to bacterial species, 
coverages varied from 67 to 81% (fig S2). 
Species richness was estimated by using Chaol 
nonparametric estimator (table $3). Each phyllo- 
sphere community harbored from at least 95 to 
671 bacterial species (Fig. 1 and table S3), of 
which only 0.5% were common to all of the trees 
studied. Almost all of the bacteria (97%) were 
from undescribed species, suggesting they may 
be unique to the phyllosphere habitat (table S4). 

Although this initial survey was limited in 
scope, extrapolation of our results for the 20,000 
vascular plant species in the Atlantic Forest would 
yield about 2 to 13 million new bacterial species. 
The absolute diversity of bacteria in nature is 
unknown, but by comparison the Earth’s oceans 
have been estimated to contain up to 2 million 
species, whereas a ton of soil may have 4 million 
species (7). The estimates for phyllosphere 
diversity could be decreased considerably should 
future surveys reveal higher amounts of overlap 


T. catigua 
T. clausenii 
C. xanthocarpa 


0.6 0.8 1.0 


Evolutionary distance 


Fig. 1. Estimated number of phylotypes at different evolutionary distances, using Chaol nonparametric 
estimator, on the leaf surface of different tree species. Bars represent 95% confidence intervals. 


in bacterial community composition between 
tree species. On the other hand, the bacterial 
species richness for the individual trees surveyed 
represent minimum estimates of that which may 
occur on individual trees. Variations in commu- 
nity structures within tree species may possibly 
correspond to different leaf ages, location in the 
canopy, light incidence, and microclimate con- 
ditions that influence the leaf environment (8). 
The current study provides a glimpse into the 
microbial diversity in tree canopies of tropical 
forests, and there are many questions that arise 
from this research. Do the same tree species in 
completely different locations or continents har- 
bor similar communities? To what degree do 
various environmental factors affect the compo- 
sition and structure of phyllosphere commu- 
nities? What is the diversity of fungi and archaea 
on the plant leaf surfaces, and what role does 
phyllosphere microbial community play in pro- 
tection against herbivory or infection by patho- 
gens? As we begin to survey the bacterial 
species through systematic surveys of different 
plants, there will be exciting opportunities for 
studies of the metabolic capabilities and the 
ecological functions of phyllosphere microorga- 
nisms in terrestrial ecosystems. 
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Food for Thought: Lower-Than- 
Expected Crop Yield Stimulation 
with Rising CO, Concentrations 


Stephen P. Long,””* Elizabeth A. Ainsworth,**? Andrew D. B. Leakey,” 
Josef Ndsberger,” Donald R. Ort**?? 


Model projections suggest that although increased temperature and decreased soil moisture will act 
to reduce global crop yields by 2050, the direct fertilization effect of rising carbon dioxide 
concentration ([CO,]) will offset these losses. The CO, fertilization factors used in models to project 
future yields were derived from enclosure studies conducted approximately 20 years ago. Free-air 
concentration enrichment (FACE) technology has now facilitated large-scale trials of the major 
grain crops at elevated [CO,] under fully open-air field conditions. In those trials, elevated [CO,] 
enhanced yield by ~50% less than in enclosure studies. This casts serious doubt on projections 


that rising [CO,] will fully offset losses due to climate change. 


uch effort has been put into linking 
Me of climate and crop growth to 
project future changes in crop yields 
and food supply across the globe (/—4). Pro- 
jections reviewed by the Intergovernmental 
Panel on Climate Change (IPCC) suggest that 
increased temperature and decreased soil mois- 
ture, which would otherwise reduce crop yields, 
will be offset by the direct fertilization effect 
of rising carbon dioxide concentration ([CO,]) 
(5-7). The IPCC projections suggest that total 
crop yield may rise when averaged across the 
globe, but this net gain will result from generally 
lower yields in the tropics and increased yields in 
temperate zones. The accuracy of these projec- 
tions and thus future food security depend crit- 
ically on the magnitude of the CO, fertilization 
effect under actual growing conditions. 
Atmospheric [CO,] has risen from ~260 
parts per million (ppm) approximately 150 years 
ago to 380 ppm today (8). Yet [CO,] is marked- 
ly uniform across the globe; so, in contrast to 
temperature and soil moisture, there is no con- 
sistent spatial variation on which to estimate 
yield responses to increasing [CO,]. Similarly, it 
is not easy to alter [CO,] experimentally around 
a crop in the field. As a result, most information 
about crop responses to elevated [CO,] is ob- 
tained from studies in greenhouses, laboratory 
controlled-environment chambers, and transpar- 
ent field chambers, where released CO, may be 
retained and easily controlled. These settings 
have provided the basis for projecting CO, fer- 


Department of Plant Biology, “Department of Crop Sci- 
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tilization effects on the major food crops: maize, 
rice, sorghum, soybeans, and wheat. 

Crops sense and respond directly to rising 
[CO,] through photosynthesis and stomatal 
conductance, and this is the basis for the fer- 
tilization effect on yield (9). In C, plants, meso- 
phyll cells containing ribulose-1,5-bisphosphate 
carboxylase-oxygenase (RuBisCO) are in direct 
contact with the intercellular air space that is 
connected to the atmosphere via stomatal pores 
in the epidermis. Hence, in C, crops, rising 
CO, increases net photosynthetic CO, uptake 
because RuBisCO is not CO,-saturated in 
today’s atmosphere and because CO, inhibits 
the competing oxygenation reaction leading to 
photorespiration. RuBisCO is highly conserved 
across terrestrial plants, so instantaneous re- 


sponses to increased [CO,] may be generalized 
across C, plants, including rice, soybeans, and 
wheat. In theory, at 25°C, an increase in [CO,] 
from the present-day value of 380 ppm to that 
of 550 ppm, projected for the year 2050, would 
increase C, photosynthesis by 38% (9). In con- 
trast, in C, crops such as maize and sorghum, 
RuBisCO is localized to bundle sheath cells in 
which CO, is concentrated to three to six times 
atmospheric [CO,] (/0). This concentration is 
sufficient to saturate RuBisCO and in theory 
would prevent any increase in CO, uptake with 
rising [CO,]. Although C, crops may not show 
a direct response in photosynthetic activity, an 
indirect increase in the efficiency of water use 
via reduction in stomatal conductance may still 
increase yield (9). 

How have CO, fertilization factors been 
derived? Most models used to predict future 
crop yields, including those within the IPCC 
(5), are from two families: the Decision Sup- 
port System for Agrotechnology Transfer 
(DSSAT) (6, 77, 12) and the Erosion Produc- 
tivity Impact Calculator (EPIC) (/3—/5). Studies 
using DSSAT assume CO, fertilization factors 
based on the method of Peart et al. (3), which 
used summaries for soybeans (/6), maize (17), 
wheat (/8), and rice (/8). Studies using EPIC 
(13-15) assume CO, fertilization factors based 
on the method of Stockle et al. (4), which 
parameterized a CO, response function to 
reproduce the mean yield stimulations reported 
for elevated [CO,] by Kimball (/8). Tracing 
DSSAT and EPIC methods back reveals that the 
magnitude of the CO, fertilization effects in 
these models is primarily based on data from 
three literature reviews from the 1980s (/6—/8). 
The CO, fertilization effects reported in these 
reviews for the major crops are given in Table 1 


Table 1. Percentage increases in yield, biomass, and photosynthesis of crops grown at elevated 
[CO,] (550 mol mol~+) relative to ambient [CO] in enclosure studies versus FACE experiments. 
Data for enclosure studies were summarized by Kimball (28), Cure and Acock (17), and Allen et al. 
(16) and in Fig. 2. Mean response ratios from these reviews were adjusted to an elevated [CO,] of 
550 pmol mol~? by means of the nonrectangular hyperbolic functions for C, and C, species from 
Fig. 2. The values that summarize all chamber studies shown in Fig. 2 are given in the row entitled 
“enclosure studies.” Percentage increases for FACE studies were generated by meta-analysis [see 
supporting online material (SOM) and table $2] (37). 


Source Rice Wheat Soybeans C, crops 
Yield 
Kimball (1983) 19 28 21 - 
Cure and Acock (1986) 11 19 22 27 
Allen et al. (1987) - - 26 - 
Enclosure studies - 31 32 18 
FACE studies 12 13 14 o* 
Biomass 
Cure and Acock (1986) 21 24 30 8 
Allen et al. (1987) - - 35 - 
FACE studies 13 10 25 o* 
Photosynthesis 
Cure and Acock (1986) 35 21 32 4 
FACE studies 9 13 19 6 


*Data from only 1 year in Leakey et al. (30). 
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after adjustment to estimate crop performance at 
a common [CO,] of 550 ppm. Collectively, the 
fertilization factors averaged across the C, crops 
(rice, wheat, and soybeans) are 24% for yield, 
27% for biomass, and 29% for photosynthesis. 
The responses for maize were lower except for 
yield, which was reported to increase by 27% 
(Table 1). All studies included in the reviews 
used enclosures, such as controlled environmen- 
tal chambers, transparent field enclosures, or 
open-top chambers. Since the 1980s, many fur- 
ther chamber studies have been conducted. 
When these are compiled for wheat and soy- 
beans, an even larger yield fertilization factor of 
31% is suggested (Table 1). Although this is a 
wealth of data on which to project a CO, fertil- 
ization effect for crops across the globe, no 
agrochemical or plant-breeding company would 
base its business plan for a new chemical or 
variety solely on greenhouse studies without 
rigorous field trials (J9, 20). Yet our current 
projections of future world food supply are 
based on such potentially inadequate data. 

Why might chamber studies be inadequate 
for predicting future yields? Many chamber 
studies used plants grown in pots, which are 
now known to alter the response of plants to 
elevated [CO,] (2/). Most of the field studies 
used open-topped and transparent-walled 
chambers, up to 2 m in diameter. Despite being 
partially open to the atmosphere, important en- 
vironmental differences remain. In a chamber 
carefully designed to minimize environmental 
differences, receiving ~75% of full sunlight, 
the temperature inside the chamber was 4.3°C 
warmer and the water vapor pressure deficit 
was 0.8 kPa higher (22) than outside the 
chamber. The transmission of sunlight into the 
chambers was lower and the ratio of diffuse to 
direct sunlight increased. Other chamber types 
would cause even greater perturbation of the 
natural environment. All chambers alter air 
flow and intercept rainfall. Access by pests 
and diseases is restricted, but if they gain ac- 


Fig. 1. (A) One of the 
16 FACE plots of soy- 
beans at the University 
of Illinois SoyFACE facil- 
ity. CO, is released into 
the wind from nozzles in 
the green pipe, on the 
upwind side of the plot. 
Release rate is determined 
by wind speed and [CO,], 
which is measured at the 
center of each ring. (B) 
The false-color infrared 
image provides a simple 
visualization of the uni- 


formity of CO, treatment within a FACE plot. Here the atmosphere around a 
maize crop within the octagonal plot is maintained at 550 ppm [CO.], whereas 
the rest of the field is at the current ambient [CO,] of ~380 ppm. Maize 
growing inside an elevated [CO,] plot was warmer in full sunlight than maize 
growing under ambient [CO,] outside the plot at 15:30 on 15 July 2004. At 
that time, the average canopy temperature inside the four elevated [CO,] plots 


cess, higher humidity and more shelter may 
accentuate epidemics. As a result, the effect of 
the chamber on plants is often greater than that 
of elevated [CO,] (23). In agronomic trials, 
buffer rows are used between treatments; typ- 
ically the width of this zone is twice the height 
of the crop. Because of the small practical size 
of chambers, most or all of the treated crop will 
be within this zone, which could exaggerate the 
response to elevated [CO,] (23). To overcome 
these limitations, free-air concentration enrich- 
ment (FACE) was developed. 

How does FACE work? A typical FACE ap- 
paratus consists of a 20-m-diameter plot within 
the crop field (Fig. 1A), in which CO, is re- 
leased just above the crop surface on the upwind 
side of the plot. Wind direction, wind velocity, 
and [CO,] (or ozone concentration) are mea- 
sured at the center of the plot. Fast-feedback 
computer control then adjusts the positions and 
amount of CO, released at different points 
around the plot. These systems have been en- 
gineered so that they can operate continuously 
from sowing to harvest and maintain [CO,] 
within the plot to within +10% of the target 
level, either 550 or 600 ppm, for ~90% of the 
time (9, 24-26) (Fig. 1B). Elevated [CO,] de- 
creases transpiration and therefore evaporative 
cooling, so that in sunlight the crop is warmer. 
This can serve to illustrate the uniformity of 
treatment (Fig. 1B). 

Mini-FACE systems as small as | m in di- 
ameter have been developed and have proved 
invaluable in ecosystem studies where the focus 
is on the effect of increased input of carbon 
(27), but they do not escape the problems of 
enclosures with respect to scale. Avoiding edge 
effects associated with small plots is critical 
when the objective is to determine an exact 
CO, fertilization factor for crops. Our analysis 
has therefore been limited to full-size FACE 
systems of plots >8 m in diameter, investigating 
the five major global food crops and managed 
pasture systems (table S1): wheat at Maricopa, 
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Arizona, USA, in 1992, 1993, 1996, and 1997; 
managed grassland at Eschikon, Switzerland, 
from 1993 to 2002; managed pasture at Bulls, 
New Zealand, from 1997 to 2002; sorghum at 
Maricopa, Arizona, USA, in 1998 and 1999; 
rice at Shizukuishi, Japan, from 1998 to 2000; 
and soybeans at Urbana-Champaign, Illinois, 
USA, from 2001 to 2005 and maize at the same 
location in 2002 and 2004 (26, 28). 

What have we learned from the FACE ex- 
periments? The response of plant production to 
[CO,] is approximately hyperbolic, increasing 
linearly at subambient concentration and satu- 
rating at around 800 to 2000 ppm. The ratio of 
yield at treatment [CO,] to yield at atmospheric 
[CO,] was calculated for over 340 independent 
chamber studies. Hyperbolas of the response 
of yield to [CO,] were then fit for wheat, soy- 
beans, and C, grains (maize and sorghum 
combined) (Fig. 2). Only one replicated FACE 
experiment was conducted with each of these 
crops, but these experiments were repeated over 
2 to 5 years. It was notable that for each crop, 
the stimulation of yield observed in FACE 
experiments fell well below (about half) the 
value predicted from chambers (Fig. 2). This 
was apparent for total biomass and most marked 
for photosynthesis. Notably, the stimulation of 
photosynthesis by elevated [CO,] in enclosure 
studies of rice was four times the value observed 
in the rice FACE experiment (Table 1). With so 
few FACE studies, it might be thought that 
these lower values are the result of chance. 
Table 1 shows that for three key production 
measures in four crops, only 1 of the 12 items is 
not lower than the chamber equivalent. The 
probability of this outcome being attributed to 
chance is remote (P = 0.003). 

Results from FACE experiments with C, 
crops are consistent with CO, having no direct 
effect on photosynthesis, but there may be 
an indirect effect through the amelioration of 
drought stress by reduced stomatal conductance 
at elevated [CO,] (29-31). This fits the theoret- 


Elevated [CO2] 27.5°C 


Ambient [CO,] 26.1 °C 


at SoyFACE was 27.9 + 0.2°C, significantly higher than canopy temperatures 
under ambient [CO,] outside the plots (26.8 + 0.3°C; P = 0.03). Because the 
pipes surrounding the plot are dry, they are warmer and so appear as white or 
light yellow. Greater canopy temperatures under elevated [CO] result from 
lower stomatal conductance, reducing latent heat loss by evapotranspiration 
and leading to lower crop water use, as described in Leakey et al. (30). 
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ical expectation that C, photosynthesis is CO,- 
saturated at current atmospheric [CO,] (/0); 
therefore, no yield increase would be expected 
for well-watered crops. Under drought, elevated 
[CO,] increased midday photosynthesis by 23% 
in sorghum (3/). This failed to translate into a 
significant yield increase (32). On average, no 
significant yield increase has been observed for 
C, crops or C, wild grasses at elevated [CO,] 
in FACE studies (28). This is in sharp contrast 
to the large stimulation of yield for well- 
watered plants in chambers (Fig. 2B) used to 
parameterize models. This suggests that the 
consistent stimulation of C, crop yield by ele- 
vated [CO,] currently applied in models is 
inappropriate. At best, yield will in all prob- 
ability be enhanced by elevated [CO,] only in 
times and places of drought. 

Wheat and rice FACE experiments included 
nitrogen treatments. At the lowest [N] (15 to 70 
kg of N ha™!), the average yield increase with 
elevated [CO,] was only 9% (28), just over 
one-third of that of the chamber response 
(Table 1). Although this N input treatment 
was considered low by the standards of 
intensive agriculture in the European Union 
and United States, these levels exceed the 
world average and may therefore be closer to 
the stimulation factor for crop yields across the 
globe. Lower-than-expected yields under ele- 
vated [CO,] are not just confined to grain 
crops. For example, the major C, herbage 
grass, Lolium perenne, also showed a yield 
increase of only 9% at two locations; and at the 
lowest [N] (100 to 140 kg of N ha~'), the yield 
increase was an insignificant 1% (table S2) 
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(28). Although the data here apply to a single 
species, L. perenne is one of the most important 
and widely grown herbage grasses in the 
temperate zone. 

No FACE experiment has been conducted 
in the tropics, but two factors emerging from 
temperate studies have particular implications 
for tropical crops. First, the CO, fertilization 
effect may be small without large additions of 
N. Second, FACE experiments with the major 
grain crops of sub-Saharan Africa, sorghum and 
maize, have so far failed to show any yield in- 
crease from elevated [CO,]. Parry et al. (7) 
projected that yield losses in these countries due 
to climate change could be 10 to 30% by 2050, 
but these would be ameliorated to only 2.5 to 
5% when the CO, fertilization effect is added 
(7). The FACE experiments suggest that this 
amelioration may be far less than expected. 

Rising surface ozone. Increased combustion 
of fuels will increase not only atmospheric 
[CO,] but also atmospheric nitrogen oxide con- 
centrations, which, when coupled with climate 
change, will result in a continued increase in 
surface ozone concentration ([O,]). Many rural 
areas in the temperate zone of the Northern 
Hemisphere, as well as in the tropics, are forecast 
to see increases in [O,] of ~20% by midcentury 
(8). Ozone is toxic to plants at concentrations as 
low as 30 parts per billion (ppb). Although 
chamber studies have shown large yield losses 
owing to elevated [O,] (33), these effects are 
not incorporated in current projections of future 
yields (2, 8). 

Until very recently, the only studies of the 
effects of elevated [O,] on crops were conducted 
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Fig. 2. Effects of elevated [CO,] on crop yield. Data are yields at elevated [CO,] relative to those at 
ambient [CO,] (arrow) for (A) soybeans in chambers (solid blue circles) and FACE (blue square, hidden 
behind red square) and wheat in chambers (red circles) and FACE (red square); and (B) C, crops (maize 
and sorghum combined) in chambers (green circles) and FACE studies (green square). Error bars 
indicate mean + 90% confidence intervals around the means for the FACE studies. The chamber studies 
included 115 independent measures of soybeans (21), 211 of wheat (36), and 14 of maize and 
sorghum (table $3). These measures were divided into 10 classes of growth [CO,] in 100-ppm 
increments. Plotted values are the class means of growth [CO,] and yield. Solid lines are the least- 
squares fits for the nonrectangular hyperbolic response of yield to growth [CO,] from these enclosure 
studies of soybeans (blue line, r2 = 0.98), wheat (red line, r2 = 0.88), and C, crops (green line, 1? = 
0.99). The yield response of soybeans in chambers to growth [CO,] of 900 to 999 ppm [open blue circle 
in (A)] was an outlier and was excluded from the curve fitting. Full details of the meta-analysis methods 
and results from FACE are presented in the SOM and table S2. 


in chambers, and it was unclear whether simi- 
lar losses would occur under conditions of nor- 
mal canopy/atmosphere coupling in the field. 
Morgan et al. (34) used a FACE system adapted 
to elevate [O,] rather than [CO,] to examine 
whether the decreases in yield for soybeans in 
central [Illinois projected from chamber experi- 
ments occurred in the open air. A 23% increase 
in [O,] from an average daytime ambient con- 
centration of 56 to 69 ppb over two growing 
seasons decreased soybean yield by 20%. How 
does this compare with the expectations estab- 
lished from chamber studies? Based on a prior 
compilation of chamber studies (33), the ex- 
pected decrease was 8%. If the effects of [CO,] 
and [O,] observed in FACE studies are additive, 
then the net effect of simultaneous increases in 
[O,] and [CO,], as forecast by the IPCC AIB 
scenarios, would be a 5% decrease in yield, 
compared with the 23% increase used to pa- 
rameterize current models (Table 1). Chamber 
studies suggest that elevated [CO,] may provide 
some protection against elevated [O,] and there- 
fore the effects will not be additive, but this has 
yet to be verified for any crop under open-air 
field conditions. 

What is needed? The CO, fertilization ef- 
fects, derived from chamber experiments, cur- 
rently used in crop models forecast substantial 
increases in future crop production under con- 
ditions associated with climate change. The 
FACE experiments, conducted in open fields, 
are not without their limitations (26, 35), but rep- 
resent our best simulations of the future elevated 
[CO,] environment. Our meta-analytic summary 
of the FACE experiments indicates that there 
will be a much smaller CO, fertilization effect 
on yield than currently assumed, and possibly 
little or no stimulation for C, crops. 

The average yield increase at elevated [CO,] 
for crops in FACE studies fell well short of the 


% Increase at elevated [CO,] 


Theory A’ Biomass Yield 


Fig. 3. Comparison of theoretical and actual 
changes in C, crop production parameters at an 
elevated [CO,] of 550 ppm relative to ambient 
[CO,]. Theory, theoretical RuBisCO-limited photo- 
synthesis at 550 ppm [(9) and SOM]; A’, mea- 
sured daily integral of carbon uptake; biomass, 
final above-ground biomass; yield, harvestable 
grain yield. Error bars indicate mean + 90% 
confidence intervals. A’, biomass, and yield were 
measured in C, crops exposed to elevated [CO,] 
in FACE experiments (table $2). 
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theoretically possible increase based on the 
well-defined properties of RuBisCO (Fig. 3). 
At 25°C, an increase in [CO,] to 550 ppm 
should increase light-saturated photosynthesis 
by 36%. The average increase observed for C, 
crops in FACE was 20% for the daily integral 
of photosynthetic CO, uptake, 17% for total 
biomass, and just 13% for yield (Fig. 3). This 
suggests that a series of feedbacks operate in 
the field to constrain realization of the potential 
benefits of elevated [CO,]. Only with a 
thorough high-priority R&D effort might we 
overcome these feedbacks and achieve the 
potential gains in food supply. 

The FACE experiments clearly show that 
much lower CO, fertilization factors should be 
used in model projections of future yields; 
however, the present experiments are limited in 
the range of growing conditions that they cover. 
Scientists have not investigated the interactive 
effects of simultaneous change in [CO,], [O,], 
temperature, and soil moisture. Technological 
advances suggest that large-scale open-air 
facilities to investigate these interactions over 
controlled gradients of variation are now pos- 
sible (26). Although we have projected results 
to 2050, this may be too far in the future to spur 
commercial R&D, but it must not be seen as 
too distant to discourage R&D in the public 
sector, given the long lead times that may be 
needed to avoid global food shortage. 
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Frictional Afterslip Following the 2005 
Nias-Simeulue Earthquake, Sumatra 


Ya-Ju Hsu,"* Mark Simons,” Jean-Philippe Avouac,* John Galetzka,’ Kerry Sieh,* 
Mohamed Chlieh,? Danny Natawidjaja,” Linette Prawirodirdjo,? Yehuda Bock? 


Continuously recording Global Positioning System stations near the 28 March 2005 rupture of the 
Sunda megathrust [moment magnitude (M,,) 8.7] show that the earthquake triggered aseismic 
frictional afterslip on the subduction megathrust, with a major fraction of this slip in the up-dip 
direction from the main rupture. Eleven months after the main shock, afterslip continues at rates 
several times the average interseismic rate, resulting in deformation equivalent to at least a 

M,, 8.2 earthquake. In general, along-strike variations in frictional behavior appear to persist over 
multiple earthquake cycles. Aftershocks cluster along the boundary between the region of coseismic 
slip and the up-dip creeping zone. We observe that the cumulative number of aftershocks increases 
linearly with postseismic displacements; this finding suggests that the temporal evolution of 


aftershocks is governed by afterslip. 


lip on faults occurs as a combination of 
relatively continuous aseismic creep and 
transient slip events. These transient events 
occur as earthquakes radiating seismic waves, 
and also as aseismic events with characteristic 
time scales of days to years. A better understand- 
ing of the physical factors that control the 


relative amounts and location of seismic and 
aseismic slip is a key goal in the study of fault 
mechanics and in particular can affect assess- 
ments of regional seismic and tsunami hazards. 
After a large earthquake, postseismic deforma- 
tion may result from earthquake-induced slip 
along the plate interface, commonly referred to 


as afterslip, and as viscoelastic relaxation in the 
volume surrounding the fault rupture (/—3). 
Thus, well-positioned postseismic observations 
can probe the mechanical properties of sub- 
duction megathrusts and the media that sur- 
round them. 

Geodetic and seismological investigations 
suggest that typical subduction megathrust 
earthquakes involve fault rupture at depths be- 
tween ~10 km and ~50 km, and that rupture all 
the way up to the trench is rare (4). However, 
evidence for slip on the shallowest portions of a 
megathrust has been notoriously difficult to 
evaluate. We commonly assume that seismic 
slip decreases in both up-dip and down-dip 
directions, presumably bounded by regions 
where frictional behavior of the fault does not 
support stick-slip (i.e., seismic) rupture (5). 
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These variations with depth are thought to 
result from the direct effect of temperature on 
the rheology of the plate interface or from 
indirect effects associated with metamorphism 
(6-8). In particular, the frictional behavior of 
the shallowest part of the megathrust is com- 
monly assumed to be governed by clay min- 
erals that promote rate-strengthening friction 
(6). Down-dip of the seismogenic zone, the 
megathrust presumably creeps continuously at 
approximately the plate convergence rate. In 
contrast to down-dip variations in seismogenic 
behavior, the underlying causes for along-strike 
variations remain enigmatic (9—/3). Further- 
more, what is actually happening along the 
shallowest portion of the megathrust is poorly 
known because of a lack of proximal geodetic 
observations in most areas. In contrast to most 
seismically active subduction zones, the pres- 
ence of islands relatively close to the trench and 
above the seismogenic portions of the Sunda 
megathrust makes it possible to constrain co- 
seismic, postseismic, and interseismic processes 
from geodetic monitoring and paleogeodetic 
studies (Fig. 1). 

The 28 March 2005 M,, 8.7 Nias-Simeulue 
earthquake resulted from rupture of the sub- 
duction megathrust off the shore of northwest- 
ern Sumatra, southeast of the 26 December 
2004 M,, 9.1 Aceh-Andaman rupture (/4, 15). 
In this region, the Indo-Australian plate con- 
verges obliquely toward the Sunda Block at a 
rate of ~57 mm/year (/6) (Fig. 1, inset). This 
convergence is approximately partitioned into a 
40 mm/year trench-normal component on the 
megathrust and a 25 mm/year component of 
dextral slip along the Sumatran Fault (77, 18). 
The subduction megathrust off the shore of 
Sumatra has produced four earthquakes with 
magnitudes larger than 8 since 1797, including 
aM ~8.5 event near Nias in 1861 (19, 20) that 
occurred in the approximate region of the 
southern asperity of the 2005 event. Paleogeo- 
detic and recent continuous Global Positioning 
System (cGPS) data from the Sumatran GPS 
Array (SuGAr) (2/) as well as survey-mode 
GPS data suggest that the shallow portion of 
the megathrust up-dip of the Batu Islands 
(Fig. 1) is creeping during the interseismic pe- 
riod (19, 22). The 2005 rupture occurred beneath 
the northern portion of SuGAr, permitting a 
record of both coseismic and postseismic 
deformation. 

The coseismic and postseismic slip model. 
We use cGPS data spanning the first 11 months 
after the 2005 Nias-Simeulue earthquake, in- 
cluding nine SuGAr sites and the station SAMP, 
installed by the Indonesian National Surveying 
Agency (BAKOSURTANAL) (Fig. 1) (5, 23). 
Three sites (LEWK, BSIM, and LHWA) were 
installed about a month before the Nias- 
Simeulue earthquake; three sites (PBLI, BITI, 
and BTHL) were installed 5 months after the 
earthquake; and three sites (PSMK, PTLO, and 
PBAJ) just south of the Equator were installed 


in mid-2002, well before the Aceh-Andaman 
earthquake. In addition, 102 field measure- 
ments of vertical displacements from coral 
microatolls constrain the coseismic slip model. 


Fig. 1. Surface dis- 
placement estimated 
at the location of the 
cGPS stations in north- 
ern Sumatra for three 
time periods: The 
interseismic period be- 
fore the 2004 Aceh- 
Andaman earthquake 
(green arrows), the 
2005 Nias-Simeulue 
earthquake coseismic 
period (red arrows), 
and the 11-month 
period after the 2005 
earthquake (black 
arrows). White and 
red stars indicate epi- 
centers of 2004 Aceh- 
Andaman and 2005 
Nias-Simeulue earth- 
quakes, respectively. 
The black barbed line 
denotes the Sunda 


In the modeling of postseismic deformation, we 
use only the cGPS data. 

The models considered here assume that all 
surface deformation is caused by slip on a dip- 
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megathrust. Interseismic displacements for LEWK, BSIM, and LHWA are not derived from cGPS data, but 
rather from a model constrained by campaign GPS data (46). Estimates of coseismic displacement for PTLO 
and PBAI are too small to be visible at this scale. Three sites (PBLI, BITI, and BTHL) were installed after the 
earthquake and do not have estimates of coseismic displacements. (Inset) Regional geography with motion 
of the Australian plate relative to the Sunda Block indicated by blue arrows. 


Fig. 2. Compilation 
of inferred coseismic 
and postseismic slip, 
illustrating extensive 
afterslip up-dip from 
the main shock and 
a lack of substantial 
overlap between seis- 
mogenic and aseismic 
regions. Distribution 
of coseismic slip is in- 
dicated by white con- 
tours at intervals of 
2 m; color indicates 
cumulative postseis- 
mic slip during the 9 
months after the main 
shock. Black and red 
vectors indicate ob- 
served and predicted 
GPS observations, re- 
spectively. Displace- 
ments measured at 
the stations deployed 
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5 months after the main shock are not shown. White and red stars are epicenters of 2004 Aceh-Andaman 
and 2005 Nias-Simeulue earthquakes, respectively. Pink and green dots denote earthquakes with m, > 
4.5 before (24) and after (43) the 2005 event. The regions of high seismicity correspond to the 
transition between regions of coseismic and aseismic slip. The large question mark east of Nias indicates 
the region where afterslip may have occurred but is not detectable by the existing cGPS network. White 
tick marks on the northern and southern boundaries of the postseismic slip model indicate depths along 


the megathrust. 
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ping fault plane (i.e., the megathrust) embedded 
in a horizontally layered elastic space (23). The 
fault geometry differs slightly from that used 
previously (24); it is extended in length and 
width and has an additional segment at depth. 
The model fault approximately follows the 
curvature of the trench and changes dip from 
10° to 30° at a depth of 27 km. The bottom of 
the model fault occurs at 100 km depth (Fig. 2) 
and follows the Wadati-Benioff zone, as 
defined by relocated seismicity spanning the 
period 1964 to 2005 (25). Constraints on the 
dip angle of the shallower portion come from 
the joint analysis of coseismic geodetic and 
seismic data (26), which yields a preferred dip 
angle for the seismogenic fault of between 8° 
and 12°. This range of dips can account simul- 
taneously for the geodetic displacements, the 
seismic moment, and the amplitude of the 
spheroidal and toroidal normal modes (26). 
We invert both coseismic and postseismic 
slip distributions with the use of the same fault 
geometry and a priori constraints on the rake 
(23). We have explored the impact of minimiz- 
ing model roughness and model length (figs. S1 
and S2). The time-dependent inversion of post- 
seismic slip history adopts the extended network 
inversion filter (ENIF) approach (27, 28). The 
GPS time series and model predictions, ex- 
cluding estimates of benchmark wobble and 
reference-frame errors, are shown in fig. S3 (29). 
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The coseismic slip distribution, constrained 
by both the cGPS and the coral microatoll data 
(24), has two major loci of slip, separated by a 
region of negligible slip near the hypocenter 
(Fig. 2). Thus, the Nias-Simeulue earthquake 
resulted from two distinct ruptures: a M, 8.2 
event to the north of the hypocenter and a M,, 
8.5 event to the south. Within the limits of 
resolution, the peak model slip values are 8 m 
(in the north) and 11 m (in the south). This 
coseismic model is similar to an earlier version 
that was determined using a slightly different 
megathrust geometry (24). Our present ap- 
proach to damping is more rigorous and leads 
to a more spatially compact distribution of slip 
(23). The spatial coverage of data enables 
discrimination of the spatial extent of primary 
slip areas (fig. S1). The lack of up-dip coseis- 
mic slip is a robust feature of our model. 

The postseismic slip distribution results from 
inversion of only the cGPS data. Even without 
microatoll data, resolution tests suggest that we 
can resolve up-dip afterslip well, whereas we 
can resolve down-dip afterslip only north of the 
hypocenter (fig. S2). The spatial distribution 
of postseismic slip remains basically station- 
ary throughout the entire 11-month period 
(figs. S5 and S6). We note that the availability 
of data from three additional sites (PBLI, 
BITI, and BTHL) south of the rupture 
increases the model spatial resolution for later 
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Fig. 3. Coulomb stress change ACFS (color) on the plate interface and amplitude of postseismic slip 
(black contours at 0.25-m intervals). For clarity, the model of coseismic slip (Fig. 2) is spatially 
smoothed before contouring. Purple arrows indicate the rake of the afterslip; green arrows indicate 
the rake of the interseismic velocity modeled/observed at the surface. White and red stars are 
epicenters of 2004 Aceh-Andaman and 2005 Nias-Simeulue earthquakes, respectively. The question 
mark east of Nias indicates a region where our afterslip model is not well constrained because of poor 


data coverage. 
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periods (fig. S6). Given that the pattern of slip 
is constant when we do not use these sites, we 
infer that the pattern we estimate including 
these sites is likely to be representative of the 
earlier period as well (fig. S6). 

Afterslip occurs in three primary regions: up- 
dip, down-dip, and south of the coseismic rupture 
(Fig. 2). The most extensive resolvable region 
of afterslip lies up-dip of the coseismic rupture, 
where the maximum amount of afterslip over 
11 months is ~1.4 m. Our confidence in the 
relative location of coseismic and postseismic 
slip is primarily driven by the observations of 
vertical displacements at two cGPS sites, 
LHWA and BSIM. These sites show coseismic 
uplift of 2.88 m and 1.60 m, respectively, but 
postseismic subsidence of 0.17 m and 0.05 m. 

In the first 11 months after the earthquake, 
cumulative fault slip of ~0.5 m occurs both 
beneath the Batu Islands and down-dip of the 
coseismic rupture. The extent of down-dip slip is 
only loosely constrained because only one cGPS 
site, SAMP, provides data from mainland Suma- 
tra. Indeed, there may have been considerable 
afterslip east of Nias, but we do not have sufficient 
data to resolve it. In many portions of the fault, 
the slip rate after 11 months is still about twice 
the long-term average plate convergence rate. 

Discussion. The good fits to both vertical 
and horizontal components of coseismic and 
postseismic displacements suggest that our 
assumed fault geometry is adequate and that 
both coseismic and postseismic deformation 
resulted from slip on the megathrust. In the first 
11 months of postseismic deformation, we see 
no evidence indicative of viscoelastic relaxation. 
This is probably due to the proximity of the 
geodetic stations to the rupture and to the short 
time span of the observations. Afterslip in the 11 
months following the main shock has a geodetic 
moment of at least 2.5 x 10?! N-m, equivalent 
to an M, 8.2 event or at least 25% of the 
coseismic moment. The estimate of postseismic 
geodetic moment is a minimum estimate, be- 
cause there are likely to be regions of afterslip 
not resolved with the available data. 

Most of the coseismic slip occurred within 
150 km of the trench and within the locked fault 
zone, as inferred from modeling of paleogeo- 
detic and geodetic data (19, 22, 30, 31). To first 
order, and within the inherent limits of our model 
resolution, the region with afterslip surrounds the 
area that slipped during the earthquake. There 
appears to be little overlap between the coseismic 
and aseismic patches. Bearing in mind the 
limitations in spatial resolution, we find that 
95% of the coseismic potency was released at 
depths between 13 and 48 km during the 2005 
event, with a maximum at ~22 km (fig. S7). 

Generally speaking, the inferred rakes of 
both coseismic and postseismic slip are parallel 
to each other, approximately perpendicular to 
the trench, and consistent with previous infer- 
ences of strain partitioning across the forearc 
(32, 33). An exception to this behavior occurs 
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in the south, near the Batu Islands, where the 
coseismic and postseismic models predict slip 
rakes that are parallel to the general direction of 
convergence between the Australian plate and 
the Sunda Block (Fig. 3). Although poorly con- 
strained, such behavior implies along-strike 
variations in strain partitioning. 

The correlation between coseismic Coulomb 
stress change, ACFS, on the plate interface and 
the distribution of afterslip (Fig. 3) (23) sug- 
gests that afterslip is a response of the 
megathrust to the sudden increase of stress 
due to the earthquake rupture. The patch with 
large afterslip up-dip of the coseismic rupture 
clearly coincides with a zone of increased 
ACFS; this correlation also seems to hold for 
the down-dip afterslip zone, except east of Nias 
where the afterslip model is poorly resolved. 

The coseismic slip distribution (Fig. 2) shows 
a distinct saddle between the Simeulue and Nias 
patches. This region of low slip corresponds to 
the location of the hypocenter and to the location 
of a distinct north-south bend in the band of 
aftershocks (Fig. 2). Previous studies have sug- 
gested that a structural tear occurs in this region, 
corresponding to the Batee Fault, and may be 
associated with more complex megathrust ge- 
ometry than that used here (/7, 34). 

To test the extent to which the spatial 
variations in slip are stationary in time, we 
consider historical and paleogeodetic records 
(19, 20). Southeast of Nias, beneath the Batu 
Islands, the region of 2005 afterslip correlates 
with a prominent cluster of medium-sized earth- 
quakes in the past century (20). However, coral 


Fig. 4. Observed and modeled post- A 
seismic displacements (A to C) and the 
relationship of these displacements to 
the cumulative number of aftershocks 


near each of the stations (D to F). Blak ¢ 
solid lines in (A) to (C) are estimated 2-20 
from a one-dimensional spring-slider E 
model in which afterslip obeys a & 
velocity-strengthening friction law [see & -40 
(23) for analytical functions and model & 

a 


parameters] (44). Blue, green, and red 
refer to vertical (U), east (E), and north -60 
(N) displacements, respectively. Note 
that the scale differs between panels. 
Regions used to calculate cumulative 
seismicity are shown in fig. S10. 
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microatolls show that, as in 2005, vertical defor- 
mation during the great historical earthquakes of 
1797, 1833, and 1861 was small (35). Hence, 
we presume that megathrust slip in this region 
is primarily aseismic, with the medium-sized 
earthquakes representing small, spatially limited 
locked regions (22). We have inferred similarly 
low seismogenic coupling beneath the northern 
half of Simeulue in the region separating the 2004 
and 2005 events (24). Seismicity preceding the 
2005 earthquake (25) reveals distinct northeast- 
southwest-trending zones near northeastern 
Simeulue and near the Batu Islands (Fig. 2). 
We suggest that these zones that experience fre- 
quent small earthquakes during the interseismic 
period are likely to be regions dominated by 
aseismic slip. Because of the higher rates of 
seismicity in these regions, it may be more likely 
that a large megathrust event would nucleate there 
and eventually grow into areas that are more 
tightly coupled. Such behavior may be seen for the 
Nias-Simeulue earthquake as well as the 2004 
Aceh-Andaman and 1995 Antofagasta, Chile, 
earthquakes (12, 15). 

The cGPS postseismic displacement histo- 
ries are well fit if we assume that afterslip results 
from rate-strengthening frictional sliding of the 
plate interface in response to the coseismic stress 
change (Fig. 4, A to C) (23). The model is a 
system consisting of a spring and a slider with a 
single degree of freedom (36), where the slider 
obeys an experimental rate-strengthening fric- 
tion law (37, 38): t., = 6,u* + Ao, In(V/V*), 
where t,, is the driving shear stress, o, is the 
normal stress, A is a positive rheological 
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parameter, V is the sliding velocity, and p* 
and V* are reference values. The postseismic 
displacement follows the predicted log[1 + 
(/Tps)] temporal evolution, where the charac- 
teristic time, T),p,, is estimated to be ~3 days, 
and dt,/d In V = Ao, is on the order of 0.2 to 
0.7 MPa. Alternatively, by analyzing the 
evolution of slip as a function of the evolving 
postseismic stresses according to our afterslip 
model (39), we find Ao, to be ~0.2 MPa both 
up-dip and down-dip of the coseismic rupture 
(fig. S8). If we assume that hydrostatic ambient 
pore pressure gives values of effective normal 
stress due to overburden of ~200 MPa and 
~1000 MPa for the up-dip (~ 10 km) and down- 
dip regions (~60 km), respectively, these values 
result in estimates of A of ~5 x 10+, com- 
parable to the value of ~10 x 104 at 35 km 
depth derived from afterslip following the 
2003 M,, 8.0 Tokachi-oki, Japan, earthquake 
(39), or the value of 3 x 10 at 50 km depth for 
the 2001 M,, 8.4 Arequipa, Peru, earthquake 
(40). These values are one to two orders of 
magnitude lower than estimated in laboratory 
studies (4/7, 42). A value of A at the lower end of 
experimental estimates (~50 x 10+) would im- 
ply a low effective normal stress of ~40 MPa. 
Any explanation for such a low value is con- 
jectural; high pore pressure is one possibility. 
About 2100 aftershocks with body wave 
magnitude m, > 3 (43) occurred in the first year 
following the 2005 event. These aftershocks 
amount to only ~7% of the postseismic geodetic 
moment, indicating that afterslip was essentially 
aseismic. Most of these aftershocks form a distinct 
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trench-parallel band between the coseismic slip 
patches and the up-dip zone of afterslip (Fig. 2). A 
less striking band of aftershocks northeast of 
Simeulue corresponds to the down-dip and south- 
ern edges of the Simeulue coseismic slip patch. 
The temporal evolution of aftershocks may 
reflect a delayed response to the coseismic stress 
change of a population of small, volumetrically 
distributed, secondary faults (42). According to 
this model, the cumulative number of after- 
shocks should follow a log(1 + //T,.) evolution, 
mathematically equivalent to that of afterslip. 
However, we may expect different characteristic 
times, 7, and T/,,,, because each process should 
reflect independent responses to the coseismic 
stress change. An alternative mechanism has 
seismicity controlled by the stressing rate as- 
sociated with afterslip (44). In this case, both 
processes should have the same characteristic 
time, as was found for aftershocks of the 1999 
Chi-Chi, Taiwan, and 2001 Arequipa, Peru, 
earthquakes (40, 44). A third potential model 
has aftershocks driven by the coseismic stress 
change but also includes the spatial and temporal 
modulation of stress-associated afterslip (42). 
To test these models, we consider two near- 
field cGPS sites (LHWA and BSIM), which we 
compare to the temporal evolution of nearby 
aftershocks, and one far-field cGPS site (SAMP), 
which we compare to the deep aftershocks. We 
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find a clear linear relationship between cumula- 
tive displacement and cumulative number of 
aftershocks in regions near these cGPS sites (Fig. 
4, D to F). For SAMP, this linear relationship 
exists both after the 2004 M, 9.1 event and 
after the 2005 event. The linear relationship 
between the cumulative number of aftershocks 
and the cumulative displacement at large times 
is not surprising (when presumably ¢ > 7, and 
t >> Tops) because both processes should then 
vary linearly with the logarithm of time. The 
linear correlation in the early postseismic phase, 
when both aftershocks and afterslip depart from 
a linear dependence on the logarithm of time 
(Fig. 5 and fig. S9), is more meaningful. In the 
context of the functional form adopted here, the 
aftershocks and the afterslip have characteristic 
times that differ by no more than a factor of 2, 
although we cannot determine whether the times 
are exactly the same (Fig. 5). We note that a 
related observation for repeating earthquakes in 
the Loma Prieta aftershock zone suggests that 
the recurrence interval follows the characteristic 
inverse time decay, consistent with a model in 
which the repeating events are driven by creep- 
ing in the surrounding medium (45). 

Our present models are limited by the lack of 
detailed structural information, in particular the 
role of splay and strike-slip faults and variations 
in elastic structure. Despite the limitations of 
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Fig. 5. (A) Comparison between the north component of postseismic displacement at LHWA (black 
dots, same as Fig. 4A) and modeled displacements [(23), eq. $1]. The gray line corresponds to the 
best-fit parameter values and is the same as in Fig. 4A, but in linear-log space. The blue and red lines 


correspond to fixing the estimates of 7... 


to be a factor of 2 greater and less than the optimal values, 


respectively. Pink lines denote roughly linear variations of postseismic displacement with the 


logarithm of time when t > T,,. 


[(23), eq. 54]. (B) Same as (A) but for cumulative number of 


aftershocks. We fix 7,, to be the same as 7, and only invert for the amplitude [(23), eq. 56]. (C and 
D) Residuals between model curves and observations with respect to the pink line. The temporal 
evolutions of afterslip and aftershock are bounded by model curves with characteristic time between 
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the existing models, the 2005 Nias-Simeulue 
earthquake clearly illustrates the importance of 
aseismic slip on the shallowest portion of the 
megathrust. A wide variety of interseismic, 
coseismic, and postseismic data from the northern 
Sumatran foreare suggests that frictional behavior 
of the megathrust varies abruptly in space but not 
rapidly in time. Early near-field deformation 
following the 2005 Nias-Simeulue earthquake is 
dominated by afterslip on the megathrust, and 
both afterslip and aftershocks have the same 
functional dependence on time. Although temper- 
ature might be a key factor controlling regional 
upper and lower limits to seismogenic patches and 
down-dip variations of frictional properties, other 
factors must be called upon to explain the long- 
lived along-strike variations of the mode of slip 
seen here and in other subduction zones (/2, 13). 
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Spin-Wave Lifetimes Throughout the 


Brillouin Zone 
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We used a neutron spin-echo method with microelectron-volt resolution to determine the 
lifetimes of spin waves in the prototypical antiferromagnet MnF, over the entire Brillouin zone. 
A theory based on the interaction of spin waves (magnons) with longitudinal spin fluctuations 
provides an excellent, parameter-free description of the data, except at the lowest momenta 

and temperatures. This is surprising, given the prominence of alternative theories based on 
magnon-magnon interactions in the literature. The results and technique open up a new avenue 
for the investigation of fundamental concepts in magnetism. The technique also allows 
measurement of the lifetimes of other elementary excitations, such as lattice vibrations, throughout 


the Brillouin zone. 


he concept of elementary excitations is 

| one of the basic pillars of the theory of 
solids. In the low-temperature, long- 
wavelength limit, such excitations do not in- 
teract and have an infinite lifetime. For nonzero 
temperatures and momenta, the lifetimes of 
elementary excitations are generally limited by 
collisions with other excitations, with important 
consequences for the macroscopic properties 
of solids. For instance, the thermal expansion of 
solids can be understood as a consequence of 
collisions between lattice vibrations (phonons). 
Because of their comparatively simple Hamil- 
tonians, magnetically ordered states are excel- 
lent testing grounds for theories of elementary 
excitations and their interactions. Nevertheless, 
the damping of spin waves in antiferromagnets 
has remained an open problem for four decades. 
Theoretical calculations of magnon lifetimes 
have been carried out since the 1960s, with 
intensive development occurring on several 
fronts in the early 1970s. However, these activ- 
ities ground to a halt by the mid-1970s due to the 
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lack of appropriate experimental data, namely, 
from momentum-resolved measurements with 
sufficient energy resolution. The only low- 


Fig. 1. (A) A diagram A 
of the spectrometer 
TRISP at the FRM-II. G 
denotes the polarizing 
guide and AP the trans- 
mission polarizer; M 
and A are the mono- 
chromator and analyzer, 
as in TAS. S is the 
sample and D the de- 
tector; VS indicates the 
velocity selector. The 
resonance coil pairs (C, 
and C,) are shown in 
red, and the mu-metal 
shielding boxes that en- 
close them in gray. The 
blue ray represents the 
path of the neutrons 
through the spectrome- 
ter, from left to right in 
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temperature data available were taken with mo- 
mentum g = 0, in antiferromagnetic resonance 
(AFMR) and parallel pumping measurements 
(/, 2). Because of the limited energy resolution, 
momentum-resolved data from neutron spec- 
troscopy (3), by contrast, were confined to the 
critical regime extremely close to the Néel 
temperature (7), where most theories do not 
apply. Until recently, no other experimental 
techniques were available that permitted high- 
resolution measurements of excitation lifetimes 
at low temperatures over the whole Brillouin 
zone. We report on a neutron spectroscopy 
method with peV resolution that is used to 
measure spin-wave (magnon) lifetimes in the 
prototypical antiferromagnet MnF, over the 
temperature range 0.04 to 0.6 7... The results 
subject long-standing theoretical predictions to 
a first experimental test and hold promise as a 
probe of elementary excitations in quantum 
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the diagram. (B) The crystal and magnetic structure of MnF,. The gray (smaller) spheres represent Mn2* 
ions and the green (larger) spheres the F_ ions. The arrows indicate the relative directions of the Mn2+ 
spins on the respective sublattices. (C) The magnon dispersion along the q. direction at three selected 
temperatures at and below 40 K. The data were taken on TRISP during the course of the linewidth 
measurements. The curves show the results of fits based on the same spin-wave result used by Okazaki et al. 
(8), in which the anisotropy is expressed by a single-ion form and the interactions of up to third-nearest 


neighbors are taken into account. 
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magnets. The technique is also widely applicable 
to other elementary excitations, such as phonons 
and crystal-field excitations. 

The determination of magnon lifetimes at 
low temperatures requires an energy resolution 
in the preV range, about two orders of mag- 
nitude better than can be achieved by standard 
neutron triple-axis spectroscopy (TAS). We 
have obtained the requisite gain in resolution 
by manipulating the Larmor phase of the neu- 
tron spin with magnetic fields (4). The TRIple- 
axis resonance SPin echo spectrometer (TRISP) 
spectrometer (Fig. 1A) weds the capability of 
TAS of accessing collective excitations through- 
out the Brillouin zone to the extremely high 
energy resolution of neutron spin-echo spectros- 
copy (5). As in typical spin-polarized triple-axis 
spectrometry, the neutrons impinging upon the 
sample are polarized, and the polarization of 
neutrons scattered from the sample is measured. 
On TRISP, this is accomplished through the use 


Fig. 2. Raw polarization 
data taken at (upper curve) 
q = 0.2 r.Lu. and T = 
15 K, (middle curve) g = 
0.15 lu. and T= 15 K, 
and (lower curve) g = 0.2 
rlu. and T = 20 K. The 
lines are exponential fits to 
the data. The corresponding 
Lorentzian magnon line- 
widths (HWHM) are 
12.4 + 0.8 preV, 15.6 + 
0.9 eV, and 18.6 + 1.0 
jeV, respectively. 


Polarization 


Fig. 3. Intrinsic magnon 

linewidth [at temper- 300 
atures ranging from 15 
to 40 K, as a function 
of q. We have plotted 
rd, q) - TG K, Ql, 
where I'(3 K, q) is given 
in the inset (see text). 
The curves show theo- 
retical expressions from 
(14, 15) (see text). Two 
different theoretical ex- 
pressions are valid for 
the small-q case, depend- 
ing on the magnitude of q 
relative to the anisotropy 
energy. Both expressions 0) 
apply only to small gq; 

Stinchcombe and Reinecke 

have applied one of them 
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of a polarizing neutron guide and a transmis- 
sion polarizer, respectively. However, in analo- 
gy to neutron spin-echo spectrometry, the TRISP 
spectrometer also includes regions of effectively 
constant magnetic field that are produced by 
pairs of radio-frequency (RF) resonance coils 
inserted symmetrically (i) between the mono- 
chromator and sample and (1i) between the 
sample and the analyzer (6). The RF frequencies 
in the coils are tuned such that each detected 
neutron that creates an excitation lying on the 
magnon dispersion curve has the same net 
Larmor phase after traversing the two spin-echo 
arms, independent of small variations in the 
wave vector of the excitation. The neutron spin 
polarization determined at the detector is then a 
measure of the linewidth (inverse lifetime) of the 
magnon. In this way, the measured linewidth is 
decoupled (to first order) from the spread in 
energy of the neutrons incident on the sample, 
which contributes to the instrumental resolution 
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to data extending up to q = 0.2 r.Lu. for MnF, near 7, (15). However, except at q = 0, the theory provides 
an excellent fit to the data as a function of g up to g ~ 0.35 r.L.u., both in the magnitude and in the 


q dependence. 
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in TAS. [For a detailed description of the 
technique, see the Materials and Methods (7).] 

We chose the antiferromagnet MnF, for the 
experiment, because its magnetic ground state 
and excitations have been investigated exten- 
sively. The lattice structure and magnetic or- 
dering of MnF, are shown in Fig. 1B. MnF, has 
the body-centered tetragonal structure, with a = 
b = 4.87 A and c = 3.30 A. The Mn?+ ions 
have spin S = *, and the spin in the center of 
the unit cell is oriented antiparallel to those at 
the comers. The strongest magnetic interaction 
is between second-nearest-neighbor Mn?+ spins 
(corner and center spins) and is antiferromagnetic 
(8). A weaker, ferromagnetic interaction exists 
between nearest-neighbor spins (along the c axis). 
A relatively strong uniaxial anisotropy that is 
predominantly the result of dipole-dipole inter- 
actions (9, /0) causes the spins to align along the 
c axis. T,, is 67.6 K. The slope of the magnon 
dispersion is required to set the tilt angles of the 
RF coils and to determine the nonintrinsic con- 
tribution to the data (7). During the experiment, 
the spin-wave dispersion was therefore measured 
at each temperature at which linewidth data were 
taken; a partial data set is shown in Fig. 1C. 

Figure 2 shows raw polarization data as a 
function of the spin-echo time t. The spin-echo 
time is proportional to the frequency in the RF 
coils and the distance between the coils, and it 
also depends on the neutron wavelength. In a 
neutron spin-echo experiment, the dependence 
of the measured polarization on t corresponds 
to the Fourier transform of the scattering func- 
tion as a function of energy. The data in Fig. 2 
are described well by an exponential decay, 
which indicates that the spectral function that 
characterizes the magnon linewidth is a Lo- 
rentzian in energy. The difference in linewidth 
(half-width at half maximum, or HWHM) be- 
tween the upper two and lower two data sets is 
in each case only ~3 eV, but it can be re- 
solved clearly. The upper pair of data sets 
represents a difference in g of 0.05 r.Lu. at 15 K 
(11). For comparison, the HWHM of the cor- 
responding TAS scans of the lower two mag- 
nons in Fig. 2, taken with fixed final neutron 
wave vector k,= 1.7 A“! is ~100 peV. 

The raw data were then corrected for in- 
strumental and nonintrinsic effects (7). Figures 
3 and 4 show the intrinsic magnon linewidth as 
a function of momentum g and temperature 7, 
respectively. The linewidth generally increases 
with increasing g and 7, due to the increasing 
likelihood of collisions with other excitations. 
However, Fig. 3 also shows that the linewidth 
deviates from this general trend and exhibits 
peaks as a function of q close to the center and 
the boundary of the antiferromagnetic Brillouin 
zone. The low-g peak is already present at 3 K, 
the lowest temperature covered by this exper- 
iment, and it evolves weakly with increasing 
temperature. This behavior is not described by 
the dominant magnon relaxation mechanisms 
for which quantitative predictions are available; 
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possible origins will be discussed below. To 
facilitate comparison with these predictions, we 
have treated the 3-K linewidth data as a 
temperature-independent contribution and sub- 
tracted it from the higher-temperature data. The 
results are shown in Figs. 3 (main panel) and 4. 

The intrinsic relaxation channel for mag- 
nons that has received by far the most attention 
in the literature is magnon-magnon scattering. 
In an “n-magnon” scattering event, a magnon 
(here, one excited by an incoming neutron) 
scatters off (n/2 — 1) thermally excited mag- 
nons, producing n/2 scattered magnons that are 
in thermal equilibrium with the sample. In the 
absence of defects and external magnetic fields, 
the lowest-order interaction that limits the 
magnon lifetime is four-magnon scattering. Un- 
fortunately, a comprehensive survey of the 
literature revealed very few theoretical predic- 
tions appropriate for comparison with our data, 
despite the existence of considerable work on 
four- and six-magnon interactions. This is either 
because the calculations used approximations 
valid only in high magnetic fields (for the pur- 
pose of comparison with AFMR data), or be- 
cause strict inequalities that define the range of 
applicability of the theoretical results are ex- 
tremely difficult to satisfy experimentally. An 
analytical expression was given by Harris et al. 
(12), who evaluated the contribution to the line- 
width from four-magnon scattering processes 
analytically for the case of single-ion anisotropy 
with g = 0 (/3). The corresponding result is 
shown in the bottom trace of Fig. 4. At low 
temperatures, the temperature dependence of 
the data is considerably weaker than that pre- 
dicted by this theoretical result for g = 0. The best 
agreement of the magnitude occurs at 40 K, 
where the experimental result is ~30% larger 
than the theoretical one. Predictions for an 
anisotropy gap of dipolar origin, which would 
be more appropriate for MnF,, are not available. 

An additional relaxation channel, in which 
magnons are scattered by thermally excited lon- 
gitudinal spin fluctuations, was considered by 
Stinchcombe and co-workers (/4, /5). The 
curves in Figs. 3 and 4 are based on this 
mechanism. (For g = 0, where the contribution 
of this relaxation is identically zero, we have 
shown the prediction of the four-magnon 
relaxation model, as discussed above.) For the 
larger-q data, the linewidth far from 7, is given 
approximately by 


“T+ pa + ary ( 


where E, is the magnon energy, g* = 2nq/a, 
u* = 2.969 A’, p* = 5.864 A3, and B = 1/k,7, 
with k, the Boltzmann constant (/5). The 
anisotropy parameter o is equal to 0.0184. 
The exchange parameter J(0) = 6.02 meV in- 
cludes both first- and second-nearest-neighbor 


exchange interactions. The parameters R, and 
R,, which are both temperature dependent, can 
each be evaluated using either experimental 
data or results from mean-field theory (/5), 
leading to considerable differences in the 
magnitude of the calculated linewidth and in 
its variation with temperature. In determining 
Ry, we used experimental data for the staggered 
magnetization (/6). Calculation of R, from the 
Brillouin function produces linewidth values that 
agree at the lowest temperatures and begin to 
deviate with increasing temperature: At 40 K, 
the calculated linewidth is 11% smaller. For Ry. 
we used the derivative of the Brillouin function. 
Calculation of R, instead from experimental 
data for the parallel magnetic susceptibility (/7) 
produces linewidth results that are 40% larger 
at 15 K and 30% smaller at 40 K. 

Given the prominence of the magnon- 
magnon scattering channel in the literature, the 
excellent agreement between this model calcu- 
lation and the experimental data is surprising. 
The dominance of the relaxation by longitudinal 
fluctuations is, however, consistent with argu- 
ments by Reinecke and Stinchcombe, who es- 
timated that the contribution to the linewidth 
from four-magnon scattering is only 1/z of the 
magnitude of the above term (/8, /9). Here, z is 
the number of neighbors that experience the 
strongest exchange interaction; z = 8 for MnF,, 
for which case z is the number of next-nearest 
neighbors. Because the analytical expression on 
which the curves in Figs. 3 and 4 are based is 
valid only at low g, deviation from the data at 
larger g is not unexpected. The general expres- 
sion for the linewidth resulting from scattering 
by longitudinal spin fluctuations (/4, 75) should 
be evaluated numerically at high g to see if the 
peak as a function of g can be reproduced 
(20, 21). 

Factors that might be expected to influence 
the linewidth at large g include hybridization of 
the magnon mode with an optical phonon and 
the possible proximity of a two-particle contin- 
uum. The two lowest observed optical phonons 
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along (00/) lie at least 6 meV higher than the 
magnon mode over the entire Brillouin zone 
(22), so the former mechanism can be ex- 
cluded. The data of Schweika et al. suggest that 
the single-magnon spectrum does not cross the 
two-particle continuum (23). 

An explanation of the peak centered at g ~ 
0.1 r.lu. (Fig. 3, inset) requires a different mech- 
anism (24). An additional potential source of 
linewidth is the hyperfine interaction, which 
gives rise to the scattering of electronic magnons 
from nuclear spin fluctuations (25). The contri- 
bution from the hyperfine interaction would 
only be weakly temperature dependent, because 
the nuclear spin system is already highly dis- 
ordered thermally at 3 K. Four-magnon scat- 
tering terms in which one electronic and one 
nuclear magnon interact have indeed been 
shown to generate maxima in the linewidth at 
nonzero q, but estimates of the amplitude of 
this contribution are at least an order of mag- 
nitude smaller than the observed effect (25). 
The crossing of magnon and transverse acoustic 
phonon modes at g = 0.04 r.l.u. may also 
contribute to the peak (22, 26, 27). An 
additional relaxation mechanism that must be 
considered as a possible source of linewidth is 
that of magnon-phonon scattering. Experimental 
estimates of the linewidth due to magnon- 
phonon relaxation in MnF, in zero field range 
over three orders of magnitude, but again they 
appear too small to explain the observed peak 
(28-30). A theoretical estimate of the spin-lattice 
relaxation time (which should be of the same 
order of magnitude as the magnon-phonon 
relaxation times) corresponds to a linewidth of 
~0.5 peV at 25 K in MnF, (3/, 32). In this 
theory, the magnon-phonon interaction arises 
from the phonon modulation of the exchange 
interaction and is dominated by two-magnon— 
one-phonon processes. The result varies as T>, 
which corresponds to a linewidth of 1 eV at 
30 K and 5 peV at 40 K. The maximum po- 
tential contribution to our data would then be 
~60% of the linewidth at g = 0 and 40 K. 


Fig. 4. Intrinsic mag- 
non linewidth with 0 < 
gq < 0.35 r.lu. and g = 
0.5 r.Lu., shown as a 
function of tempera- 
ture. As in Fig. 3, data 
taken at 3 K have been 
subtracted. For 0.1 < 
q< 0.35 rlu., the curves 
show theoretical results 
from (15). For q = 0, 
results from Harris et al. 
are plotted (12, 13). 
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Other mechanisms that may contribute to the 
presence of this peak include two- and three- 
magnon non—momentum-conserving processes 
that originate from scattering from defects 
(25, 33, 34). The linewidth originating from 
the latter process is peaked at intermediate g. 
Using parameters derived from comparison 
with data on RbMnF,, one can estimate its 
contribution in MnF, to be two orders of mag- 
nitude smaller than the data (34). 

The challenge to theory posed by the 
temperature- and momentum-dependent peaks 
in the magnon linewidth in MnF, should 
stimulate new activity in the field of spin-wave 
decay mechanisms. High-resolution lifetime mea- 
surements over the full Brillouin zone in a rel- 
atively simple antiferromagnet such as MnF, 
permit detailed evaluation of proposed processes, 
which should provide a basis for addressing such 
interactions in more complex magnetic systems. 
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Surface and Deep Ocean 
Interactions During the Cold 
Climate Event 8200 Years Ago 


Christopher R. W. Ellison,? Mark R. Chapman,?* lan R. Hall?* 


Evidence from a North Atlantic deep-sea sediment core reveals that the largest climatic 
perturbation in our present interglacial, the 8200-year event, is marked by two distinct cooling 
events in the subpolar North Atlantic at 8490 and 8290 years ago. An associated reduction in deep 
flow speed provides evidence of a significant change to a major downwelling limb of the Atlantic 
meridional overturning circulation. The existence of a distinct surface freshening signal during 
these events strongly suggests that the sequenced surface and deep ocean changes were forced by 
pulsed meltwater outbursts from a multistep final drainage of the proglacial lakes associated with 


the decaying Laurentide Ice Sheet margin. 


hanges in the mode of operation of the 
Atlantic meridional overturning circu- 
lation (MOC) are thought to be an im- 
portant driver of rapid, large-scale climate 
change (/, 2) and are widely believed to be 
linked to changes in freshwater forcing (3). 


The climate of the present interglacial, which 
began about 10,000 years ago, is remarkably 
stable when compared to the last glacial 
period (4) with the exception of a single, 
large climatic excursion ~8200 years before 
the present (yr B.P.) (5), commonly referred 


to as the 8.2 ky event. It has been suggested 
that the short-lived cooling episode captured 
in the Greenland ice core records is one 
element of a broader climate signal that has 
an interhemispheric signature (6), but the 
magnitude and the sequence of changes 
surrounding this climate event are not always 
straightforward to decipher (7). The decay of 
the Laurentide Ice Sheet and the catastrophic 
final drainage of the large proglacial Lakes 
Agassiz and Ojibway at ~8470 yr B.P. (8) 
preceded the 8.2 ky event, so the precise 
relation between the lake outburst, the oce- 
anic response, and the cooling observed in the 
wider circum-Atlantic region remains un- 
resolved (6). However, the similarity in the 
timing of these two events, coupled with 
model-derived suggestions regarding the sen- 
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sitivity of the MOC to freshwater inputs (9), 
have led to a widely held view that a 
meltwater-induced alteration to the ocean 
circulation may explain the trends observed 
in paleoclimate records. Despite recent efforts 
to detect MOC changes in paleoceanographic 
archives (J0, 11), clear and conclusive evi- 
dence has yet to be reported (6, 7). 

We analyzed deep-sea sediment core MD99- 
2251 recovered from the southern limb of the 
Gardar Drift in the subpolar North Atlantic 
(57°26.87' N, 27°54.47° W; 2620-m water 
depth), where the interaction of Iceland Scotland 
Overflow Water (ISOW) with the underlying 
sea-floor topography results in sediment focusing 
and exceptionally enhanced sediment accumula- 
tion rates. Calibrated accelerator mass spectrom- 
etry 4C dates (Fig. 1) (/2) indicate a mean 
sediment accumulation rate of ~110 cm ky7! 
through the studied time interval of 9200 to 7200 
years ago. Our sampling resolution over this 
interval is <20 years, with each sample rep- 
resenting an integrated signal of 6 to 10 years. 
Consequently, it has been possible to produce 
regional climate proxy records of unprecedented 
detail that reveal unambiguous information about 
the nature of subcentennial scale events and the 
relative phasing of surface and deep ocean 
changes. Because our climate proxy recon- 
structions are coregistered signals from within a 
single sediment core, the observed sequence of 
events and temporal offsets are robust features 
independent of correlation problems (7). 

Variations in the relative abundance of the 
polar foraminifer Neogloboquadrina pachy- 
derma sinistral (s.) coiling are routinely used 
to determine past positional changes of cold 
surface waters (4, 13). The marked decline in 
N. pachyderma s. abundances from >60% to 
<2% at ~1600 cm in core MD99-2251 
defines the northward retreat of polar waters 
at the onset of the Holocene (Fig. 1). Low 
abundances of the polar foraminifer (mean of 
1.4%) persist throughout the Holocene with 
the exception of a marked excursion centered 
at ~1220 cm. This episode marks the cold 
climatic extreme in the Holocene, with 
faunal-derived transfer function estimates of 
summer sea surface temperature (SST) ~2°C 
colder than at present. The N. pachyderma s. 
percent abundance maximum is dated to 8290 
yr B.P. in our record, in good agreement with 
the age of the pronounced cold excursion 
identified in the 6!8O-derived atmospheric 
temperature record from the Greenland Ice 
Sheet Project 2 (GISP2) ice core (5, /4). 
Although age estimates for the cold extreme 
differ by ~100 years in the oceanic and ice 
proxy records, this difference is not significant 
compared with the combined dating uncertain- 
ties (12, 15), and we consider the surface ocean 
and atmospheric cooling events to be synchro- 
nous. This interpretation is supported by the 
structural similarity across the temperature ex- 
cursion, i.e., the rates of change associated with 


the cooling into and warming after the event 
together with the near-identical duration for the 
event of ~70 years in both archives (Fig. 2). 

Both of our surface ocean proxies, % N. 
pachyderma s. and the 6!8O composition of 
the planktonic foraminifer Globigerina bul- 
loides, reveal the existence of a separate, 
earlier climate event centered at 8490 yr B.P. 
(Fig. 2). This indicates that the prominent 8.2 
ky cooling was not an isolated event but may 
be the culmination of a change initiated ~200 
years earlier. These two marked cooling 
episodes occur within an interval of reduced 
SSTs that existed from ~8900 to 8000 yr 
B.P. (Fig. 3). This multicentennial cool phase, 
which is broadly consistent with the recent 
identification of a 400- to 600-year climate 
anomaly preceding the 8.2 ky event (7), 
signifies a longer-term change in the pattern 
of ocean heat transport and may reflect the 
cumulative impact of outflow from melting ice 
sheets (J6). The dust supply to Greenland, 
inferred from variations in potassium content in 
the GISP2 ice core (/7), suggests a similar 
pattern of polar atmospheric reorganization 
over this interval. 

The abrupt initial cooling at 8490 yr B.P., 
which lasted ~80 years according to the % N. 
pachyderma s. data, was coincident with a ~0.6 
per mil (%o) depletion in planktonic foraminif- 
eral 5'8O values (Fig. 2). The magnitude of this 
5'80 excursion increases further once tempera- 
ture and ice volume effects are incorporated 
(Fig. 3). The resulting shift in 8'8O... ater COM- 
position can only readily be explained by a 


substantial freshening of the surface ocean. This 
perturbation corresponds to a salinity reduction 
of | to 1.3, assuming a freshwater input with an 
end-member 5!8O composition of between 
—35%o and —20%o (18). The changes in surface 
ocean proxy records 8490 years ago in the 
subpolar North Atlantic have no large-scale 
counterpart within the GISP2 temperature data 
but are hinted at in other paleoclimate proxy 
records (/7, 79), and, significantly, the timing of 
these coregistered signals is indistinguishable 
from the timing of the catastrophic meltwater 
release from Lakes Agassiz and Ojibway at 
~8470 yr B.P. (8). This discharge event, whose 
combined volume has been estimated at 163,000 
km [or 5.2 x 10° m? s~! (Sy) if released in one 
year (16)], is the only identified potential fresh- 
water source of sufficient magnitude to explain 
the presence of the pronounced 8180... ater 
signal at the distal, open ocean location of core 
MD99-2251. 

The more prominent cooling episode, 
associated with a further SST decrease of 
~1.2°C centered at 8290 yr B.P. (Fig. 3), 
occurs ~200 years after the initial lake 
outburst and surface ocean cooling event. This 
event, correlative with the 8.2 ky event 
observed in Greenland ice, has a wider oceanic 
imprint and is registered elsewhere both in the 
North Atlantic (4, 20) and Norwegian Sea 
(13, 21). Notably, the initiation of this SST 
cooling phase is synchronous with a second and 
less pronounced freshening of the surface ocean. 
This signal may be related to a later, smaller 
freshwater outburst from the western Agassiz 
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Fig. 1. The % abundance (>150 jm) of the polar foraminifer WV. pachyderma s. coiling through the 
Holocene section of core MD99-2251 (note reversed axis). V. pachyderma s. % traces the rapid retreat 
of polar waters, marking the onset of the Holocene warming, at ~1600 cm and captures a significant, 
brief readvance of polar waters at ~1220 cm. The core chronology has its basis in 23 accelerator mass 
spectrometry (AMS) 14C dates that have been converted to calendrical ages by using the CALIB program 
(31). Position and calibrated ages of AMS 14C dates are indicated along depth axis (black triangles) 
(12). The shaded area highlights the study interval shown in Figs. 2 and 3 (i.e., 9200 to 7200 years 
ago). Mean sediment accumulation rate through this interval is ~110 cm ky~?. 
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region that postdates the main Lake Agassiz and 
Ojibway discharge (J6). The possibility of 
complex variations in freshwater forcing 
through changes in drainage pathways, dis- 
charge rate, reservoir refilling, and multipulse 
flooding has been highlighted previously (22). 
It is interesting to note that the two-step draw- 
down model (/6), whereby the Lakes Agassiz 
and Ojibway meltwaters discharge in separate 


Fig. 2. Surface ocean 
climate proxy records in 
core MD99-2251 com- q 
pared to the GISP2 ice 
core data over the in- 
terval 9200 to 7200 
years ago. (A) NV. pachy- 
derma s. % abundance 
data and (B) the GISP2 
temperature (57°O,_.) 
record (14). Dating errors 
around the 8.2 ky event 
are shown for each time 
series. The lower and 
upper 20 limits on the 
calibrated AMS *4C date 
for the cooling event in 
core MD99-2251 (8290 
yr B.P. at 1220-cm depth) 
are 8180 and 8340 yr 33 
B.P., respectively. The 
GISP2 time scale has 
an estimated error of 
up to 2% on the basis -34 -| 
of annual layer counting M 

(15); this equates to a iN 
maximum dating uncer- 
tainty of +164 years for 
8200 years ago. (C) 
GISP2 5180... record 
(red) overlain on the NV. 
pachyderma s. % abun- 
dance data (blue) in “36 
MD99-2251, with the 
age scale of the GISP2 4 
5180... offset by 100 “lp 
years (i.e., showing the 4 
interval 9100 to 7100 
ice core years). The appli- 
cation of a constant age 
offset of +100 years, 
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113,100 km? (3.6 Sv) and 49,100 km? (1.6 Sv) 
pulses (for a period of 1 year), appears to be 
consistent with the multievent sequence found 
in our paleoceanographic records. 

The impacts of these changes to the near- 
surface environment on the flow regime of 
deep ocean currents are determined by an- 
alyzing the grain size distributions of the 
terrigenous fine fraction (23). Variations in 
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mond marker and error bar indicates the timing of Lakes Agassiz and Ojibway outburst at ~8470 yr 
B.P. (8) and is associated with the first major shift in MD99-2251 proxy data (indicated by older 
shaded bar). Younger shaded bar highlights the largest cooling in the MD99-2251 proxy data, 


correlated with the 8.2 ky event. 
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the mean size of the sortable silt (SS) indicate 
that the near-bottom flow speed of ISOW, an 
important component of the Atlantic MOC, 
changed significantly during the 8.2 ky event 
(Fig. 3). The cooling and freshening of the 
surface ocean at 8490 yr B.P. coincides with 
the first signs of a longer-term reduction in 
the flow speed of ISOW. We document the 
most pronounced reduction in ISOW current 
vigor at the conclusion of the second surface 
ocean perturbation around 8290 yr B.P., with 
a rapid drawdown in ISOW flow speed taking 
place over a few decades. The minimum flow 
speeds attained at 8260 yr B.P. then remain 
subdued for a further 100 to 200 years. The 
stepped warming trends after the 8.2 ky event 
evident in the % N. pachyderma s. and GISP2 
5'80 records appear to be in phase and lead 
the recovery in the deep flow speeds by ~200 
years, demonstrating a nonlinearity in the 
response of the MOC to such perturbations. 
The longer-term response of the deep ocean 
circulation to the 8.2 ky event indicated by the 
SS data is of similar duration to that estimated 
in modeling experiments (24). Both SST and 
salinity remained at intermediate values during 
the SS minimum, most likely sustained by a 
combination of the enhanced baseline melt- 
water discharge that persisted for centuries after 
the initial outburst (16) and the MOC adjust- 
ment time required to account for the altered 
surface ocean buoyancy forcing. SST and 
paleocurrent speed proxies are in phase by the 
time of the later climate transition at ~7900 yr 
B.P., with elevated flow speeds suggesting a 
somewhat enhanced MOC activity that may be 
analogous to the “climate overshoot” oceanic 
response associated with abrupt interstadial 
warming during the last glacial period (/8). 
We can interpret the decline in SS values 
across the 8.2 ky event as resulting from re- 
duced deep water convection in the Nordic and 
Arctic seas in response to freshwater perturba- 
tions and the subsequent propagation of this 
signal to the deep ocean through ISOW flow 
rates. However, the vigor of ISOW recorded in 
MD99-2251 also may be modified by the de- 
gree of entrainment of surface and intermediate 
water masses in the Iceland Basin during its 
descent to depth after its overflow of the 
Iceland-Scotland Ridge. Such changes are 
controlled by the density contrast between the 
overflowing waters and the upper water column 
(25, 26). Hence, the ISOW flow signal we ob- 
served may reflect a “shoaling” of intermediate 
waters. If the boundary between intermediate 
and deep waters rose in a similar fashion to the 
situation at the last glacial maximum (27), then 
the ISOW flow strength recorded at the MD99- 
2251 core site could be reduced. The Holocene 
perturbation in deep flow speed is ~50% of the 
relative magnitude of the circulation decrease 
that marked the end of the last interglacial 
period (~118 kyr B.P.) on the Gardar Drift 
(28), although such comparisons are only 
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meaningful if the prevailing MOC configura- 
tions were similar for both events. Whatever the 
origin of the ISOW flow signal, it is clear that 
we observe a significant change in the initial 
stages of the MOC. These findings lend sup- 
port to model experiments that predict a re- 
duction in MOC intensity during the 8.2-ky 
event (6, 9, 24). 

Although two minima in solar output in- 
ferred from cosmogenic !4C production records 
(29) may correlate with the key meltwater 
events and SST minima that we identify at 
8490 and 8290 yr B.P., there is no clear relation 
between oceanic proxy records and solar var- 
iability through the remainder of the study in- 
terval. Hence, any causal linkage would appear 


to be dependent on other factors, such as the 
preconditioning of the North Atlantic by 
enhanced meltwater input. Rather, our coregis- 
tered paleoclimate records most likely demon- 
strate a cause and effect relationship: The 8.2 
ky event was forced by the freshwater anomaly 
associated with the pulsed drainage of Lakes 
Agassiz and Ojibway, which caused extensive 
cooling and freshening in the North Atlantic 
Ocean, and this in turn can be directly linked to 
alterations in the MOC. Although the duration 
of the meltwater release is likely to be on the 
order of years (/6) and the surface ocean 
perturbations seen in the subpolar North Atlan- 
tic are on the order of decades, the associated 
change in the deep ocean flow lasted ~400 


Fig. 3. Surface and 
deep ocean linkages over 
the interval from 9200 
to 7200 years ago. (A) 
N. pachyderma s. % 
abundance data, (B) 
summer SST, (C) surface 
ocean stable isotope 
(5780, cawater) composi- 
tion, and (D) SS mean 
grain size. SST values (er- 
ror + 1°C) were estimated 
by using the modern ana- 
log technique and are 4 
derived from full plank- 
tonic foraminiferal as- 
semblage counts (~400 
individuals) using the 
>150-um size fraction 
(12). The depletion in 
5780. water Values at 
8490 and 8330 yr B.P. 
correspond to reduc- 
tions in surface ocean 
salinity of 1 to 1.3 and 
0.5 to 0.7, respective- 
ly, assuming a fresh- 
water end-member 5180 
composition of between 
—35%o and —20%o (18). 
The SS is a paleocurrent 
flow speed proxy (double 
arrow), where a_ higher 
mean reflects stronger 
flow of the depositing 
current and vice versa | 


N. pachyderma s.(%) 


5" O.cawater (%0) 


(23). Shaded bars are as 
in Fig. 2 and indicate the 
two pulses of cooling i ] 
and freshening of sur- 
face ocean conditions 
that appear to correlate 
with a larger (3.6 Sv) 
meltwater outburst at 
~7700 *C yr B.P. and 
a second, smaller melt- 
water discharge (1.6 Sv) 
dated at ~7600 “C yr 
B.P. (16). 
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years. Such information on the sensitivity and 
response of the climate system to a disturbance 
of the MOC during interglacial times is crucial 
and relevant to our possible future, because 
many global climate models suggest that an in- 
terruption of the MOC is a likely outcome of 
future climate change (30). 
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Phase Modifiers Promote Efficient 
Production of Hydroxymethylfurfural 
from Fructose 


Yuriy Roman-Leshkov, Juben N. Chheda, James A. Dumesic* 


Furan derivatives obtained from renewable biomass resources have the potential to serve as 
substitutes for the petroleum-based building blocks that are currently used in the production of 
plastics and fine chemicals. We developed a process for the selective dehydration of fructose to 
5-hydroxymethylfurfural (HMF) that operates at high fructose concentrations (10 to 50 weight %), 
achieves high yields (80% HMF selectivity at 90% fructose conversion), and delivers HMF in a 
separation-friendly solvent. In a two-phase reactor system, fructose is dehydrated in the aqueous 
phase with the use of an acid catalyst (hydrochloric acid or an acidic ion-exchange resin) with 
dimethylsulfoxide and/or poly(1-vinyl-2-pyrrolidinone) added to suppress undesired side reactions. 
The HMF product is continuously extracted into an organic phase (methylisobutylketone) modified 
with 2-butanol to enhance partitioning from the reactive aqueous solution. 


e are entering an era of diminishing 
\ \ availability of petrochemical re- 
sources used to produce the energy 


and chemical materials needed by society. 
Abundant biomass resources are a promising 
alternative for the sustainable supply of valuable 
intermediates (such as alcohols, aldehydes, 
ketones, and carboxylic acids) to the chemical 
industry for production of drugs and polymeric 
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materials. In this context, the high content of 
oxygenated functional groups in carbohydrates— 
the dominant compounds in biomass—is an 
advantage, in contrast to the drawbacks of such 
functionality for the conversion of carbohy- 
drates to fuels. However, efficient processes 
must be developed for the selective removal of 
excess functional groups and the modification 
of others to create the desired products. 
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Here, we present a strategy for the selective 
dehydration of fructose to produce HMF, 
thereby providing a cost-effective route for the 
synthesis of this valuable chemical intermediate. 
Indeed, HMF and its ensuing 2,5-disubstituted 
furan derivatives can replace key petroleum- 
based building blocks (/). For example, HMF 
can be converted to 2,5-furandicarboxylic acid 
(FDCA) by selective oxidation, and Werpy and 
Petersen (2) and Pentz (3) have suggested that 
FDCA can be used as a replacement for ter- 
ephthalic acid in the production of polyesters 
such as polyethyleneterephthalate (2) and poly- 
butyleneterephthalate. They have also sug- 
gested that the reduction of HMF can lead to 
products such as 2,5-dihydroxymethylfuran 
and 2,5-bis(hydroxymethyl)tetrahydrofuran, 
which can serve as alcohol components in the 
production of polyesters, thereby leading to 
completely biomass-derived polymers when 
combined with FDCA. In addition, HMF can 
serve as a precursor in the synthesis of liquid 
alkanes to be used, for example, in diesel fuel (4). 

Unfortunately, as noted by various authors 
(5-8), the industrial use of HMF as a chemical 
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Fig. 1. Schematic representation of reaction pathways for acid-catalyzed dehydration of fructose. Structures in brackets correspond to representative 
species. Detailed chemistry is presented in (20, 23). 
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intermediate is currently impeded by high pro- 
duction costs. Early work showed that HMF could 
be produced in high yields using high-boiling 
organic solvents [such as dimethylsulfoxide 
(DMSO), dimethylformamide, and mixtures of 
polyethyleneglycol with water] over various 


Table 1. Results for acid-catalyzed dehydration of fructose. Runs 1 to 27 were 
carried out at 453 K for 2.5 to 3 min with 0.25 M HCl aqueous phase solutions; 
runs 28 to 39 were carried out at 363 K for 8 to 16 hours with an acidic ion- 
exchange resin at a 1:1 w/w fructose:resin ratio. Aqueous phase and organic 
phase compositions are reported as w/w ratios. Conversion is defined as the ratio 


catalysts, including sulfuric acid and sulfonic 
acid resins. However, this approach necessi- 
tates difficult and energy-intensive isolation 
procedures (6, 9—/3). In pure water, fructose 
dehydration is generally nonselective, leading to 
many by-products besides HMF (/4). Recent 


org 


advances have shown improved results in pure 
water or in water-miscible solvent systems (such 
as acetonitrile or acetone), but only with the use 
of low initial fructose concentrations, which 
inevitably generate low HMF concentrations 
(/, 10, 15, 16). Biphasic systems, in which a 


of fructose consumed to fructose added initially. R = [HMF],,./[HMFl... 
Standard runs for HCl, H,SO,, and HPO, catalysts used 1.5 g of aqueous phase 
and 1.5 g of extracting solvent. Runs marked with an asterisk used 3 g of 
extracting solvent. Runs for resin catalyst used 5.0 g of aqueous phase and 5.0 g of 
extracting solvent. V IV, Was measured upon completion of reaction. 


Run Ayieous phase vomnesiion Organic phase Conversion Selectivity [HMF].,, [HMF],,. R Vorg! 
no. composition (%) (%) (g/ml) (g/ml) Vag 
30 wt % fructose with HCI catalyst 
1 Water none 50 51 0.060 - 0.00 0.00 
2 Water MIBK 91 60 0.056 0.050 0.90 1.51 
3 Water MIBK 75 73 0.035 0.033 0.96 3.13 
4 Water 7:3 MIBK:2-butanol 68 70 0.033 0.054 1.65 1.56 
5* Water 7:3 MIBK:2-butanol 86 80 0.026 0.045 1.73 3.68 
6 8:2 Water:DMSO MIBK 94 67 0.077 0.050 0.66 1.41 
7 8:2 Water:DMSO 7:3 MIBK:2-butanol 80 75 0.050 0.064 1.30 1.49 
8* 8:2 Water:DMSO 7:3 MIBK:2-butanol 87 82 0.034 0.046 1.39 3.65 
9 7:3 Water:PVP MIBK 74 66 0.055 0.041 0.81 1.56 
10 7:3 Water:PVP 7:3 MIBK:2-butanol 62 76 0.042 0.047 1.25 1.57 
11* 7:3 Water:PVP 7:3 MIBK:2-butanol 79 82 0.030 0.041 1.44 3.83 
12 7:3(8:2 Water:DMSO):PVP MIBK 79 75 0.071 0.047 0.71 1.52 
13 7:3(8:2 Water:DMSO):PVP 7:3 MIBK:2-butanol 82 83 0.063 0.065 1.12 1.62 
14* 7:3(8:2 Water:DMSO):PVP 7:3 MIBK:2-butanol 89 85 0.043 0.046 1.17 3.99 
50 wt % fructose with HCI catalyst 
15 Water none 51 28 0.064 - 0.00 0.00 
16 Water MIBK 65 47 0.049 0.051 1.11 1.80 
17 Water 7:3 MIBK:2-butanol 71 59 0.049 0.079 1.73 1.91 
18* Water 7:3 MIBK:2-butanol 88 72 0.045 0.069 1.55 4.66 
19 8:2 Water:DMSO MIBK 71 57 0.076 0.060 0.86 1.69 
20 8:2 Water:DMSO 7:3 MIBK:2-butanol 80 63 0.077 0.085 1.19 1.87 
21* 8:2 Water:DMSO 7:3 MIBK:2-butanol 91 74 0.059 0.072 1.30 4.87 
22 7:3 Water:PVP MIBK 85 56 0.074 0.060 0.80 1.72 
23 7:3 Water:PVP 7:3 MIBK:2-butanol 77 61 0.076 0.081 1.19 1.85 
24* 7:3 Water:PVP 7:3 MIBK:2-butanol 90 77 0.062 0.070 1.22 5.15 
25 7:3(8:2 Water:DMSO):PVP MIBK 77 61 0.095 0.066 0.77 1.85 
26 7:3(8:2 Water:DMSO):PVP 7:3 MIBK:2-butanol 72 62 0.068 0.074 1.25 1.89 
27* 7:3(8:2 Water:DMSO):PVP 7:3 MIBK:2-butanol 92 77 0.076 0.070 1.03 5.11 
10 wt % fructose with ion-exchange resin catalyst 
28 Water MIBK 75 44 0.010 0.011 1.02 1.32 
29 Water MIBK 17 43 0.0021 0.0024 1.15 1.29 
30 Water 7:3 MIBK:2-butanol 61 60 0.0086 0.014 1.61 1.31 
31 8:2 Water:DMSO MIBK 84 47 0.015 0.012 0.79 1.26 
32 8:2 Water:DMSO MIBK 19 80 0.0052 0.0045 0.87 1.24 
33 8:2 Water:DMSO 7:3 MIBK:2-butanol 74 68 0.015 0.017 1.18 1.24 
34 7:3 Water:PVP MIBK 74 63 0.018 0.013 0.79 1.43 
35 7:3 Water:PVP 7:3 MIBK:2-butanol 70 65 0.015 0.015 1.04 1.46 
36 7:3(8:2 Water:DMSO):PVP MIBK 80 1 0.026 0.013 0.54 1.38 
37 7:3(8:2 Water:DMSO):PVP 7:3 MIBK:2-butanol 76 77 0.020 0.019 1.03 1.43 
30 wt % fructose with ion-exchange resin catalyst 
38 7:3(8:2 Water:DMSO):PVP MIBK 89 60 0.066 0.041 0.66 1.65 
39 7:3(8:2 Water:DMSO):PVP 7:3 MIBK:2-butanol 83 65 0.053 0.051 1.07 1.74 
30 wt % fructose with H,SO,, catalyst 
40* Water 7:3 MIBK:2-butanol 80 66 0.022 0.035 1.63 3.54 
41* 8:2 Water:DMSO 7:3 MIBK:2-butanol 85 71 0.029 0.040 1.35 3.59 
30 wt % fructose with HPO, catalyst 
42* Water 7:3 MIBK:2-butanol 65 65 0.016 0.029 1.89 3.47 
43* 8:2 Water:DMSO 7:3 MIBK:2-butanol 51 76 0.016 0.025 1.58 2.95 
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water-immiscible organic solvent is added to 
extract continuously the HMF from the aque- 
ous phase, have also been investigated with the 
use of mineral acid or zeolite catalysts at 
temperatures greater than 450 K (6, 17-21). 
However, poor HMF partitioning into the 
organic streams used in these studies neces- 
sitated large amounts of solvent, thereby requir- 
ing large energy expenditures to purify the 
diluted HMF product (22). 

Fructose is dehydrated in the presence of 
an acid catalyst to produce HMF and various 
by-products (Fig. 1). Although evidence exists 
supporting both the open-chain and the 
cyclic fructofuransyl intermediate pathways 
(20, 23), it is clear that the reaction inter- 
mediates and the HMF product degrade by 
means of processes such as fragmentation, 
condensation, rehydration, reversion, and/or 
additional dehydration reactions. We used a 
modified biphasic system to address key HMF 
production limitations. Specifically, we im- 
proved the HMF selectivity (defined as the 
moles of HMF produced divided by the moles 
of fructose reacted) of the acid-catalyzed 
dehydration of concentrated [30 to 50 weight 
% (wt %)] fructose feeds by adding modifiers to 
both phases. The reactive aqueous phase 
containing the catalyst and the sugar was 
modified with polar aprotic molecules [DMSO 
or 1-methyl-2-pyrrolidinone (NMP)] and a hy- 
drophilic polymer [poly(1-vinyl-2-pyrrolidinone) 
(PVP)]. The water-immiscible organic phase 
[methylisobutylketone (MIBK)] used during 
the reaction to extract HMF was modified with 
2-butanol. The ratio of relative volumes of the 
organic and aqueous phases in the reactor 
(Vee Vig) as well as the ratio of the HMF 
concentration in the organic layer to that in 
the aqueous layer (defined as the extraction 
ratio, R) proved to be important variables in 
the process. Upon completion of the dehydra- 
tion reaction, both phases can be separated for 
efficient product isolation. Although various 
acid catalysts can be used to perform the 


Table 2. Simulation of HMF yield (Y) and energetic yield (Yn) for 
selected dehydration systems. [HMF],, corresponds to the HMF concen- 
tration in the aqueous phase leaving the extractor, and [HMF],,, 


dehydration reaction, HCl showed the highest 
HMF selectivity of the common mineral acid 
catalysts (Table 1, runs 5, 8, and 40 to 43). 

We performed experiments using 0.25 M 
HCI at 453 K under autonomous pressure. To 
optimize selectivity, we varied the concen- 
trations of each aqueous phase modifier 
(Table 1 and Fig. 2) (24). Together, DMSO 
and PVP increased the selectivity from 60 to 
75% (Fig. 2A). However, we also sought to 
optimize partitioning of the HMF product into 
the organic phase both to minimize degradation 
reactions arising from extended HMF residence 
in the reactive aqueous phase and to achieve 
more efficient recovery of HMF in the subse- 
quent isolation step. Unfortunately, the aqueous 
additives are a liability in this respect, because 
they increase the solubility of HMF in the 
aqueous phase (thereby decreasing the R value). 
The addition of 2-butanol to the organic phase 
helped counteract this effect by increasing 
HMF solubility in the organic phase relative 
to pure MIBK (Fig. 2B). Although pure 2-butanol 
is moderately soluble in water [up to 19.5 wt % 
at 298 K, as measured by high-performance 
liquid chromatography (HPLC)], it partitioned 
effectively into the MIBK layer when mixed 
with MIBK at a 3:7 weight/weight (w/w) ratio 
(<5 wt % at 298 K was detected in the aqueous 
phase by HPLC). Starting with a 30 wt % 
aqueous fructose solution, our optimal results 
using all three modifiers (DMSO, PVP, and 
2-butanol) yielded 0.065 g/ml of HMF in the 
organic layer, with 83% HMF selectivity at 
82% conversion (Table 1, run 13). 

The selectivity for production of HMF de- 
creases when the initial sugar concentration is 
increased from 30 to 50 wt % (Fig. 2C). This 
result is in agreement with literature reports 
that increasing the fructose concentration 
leads to higher rates of condensation re- 
actions (6, /3). The addition of DMSO, PVP, 
and/or 2-butanol to the 50 wt % fructose 
systems in the same quantities used for lower 
fructose concentrations did not substantially 
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improve the HMF selectivity. However, 
doubling the amount of the 7:3 MIBK:2- 
butanol extracting solvent increased the 
selectivity substantially. The optimal sys- 
tem using all three modifiers generated 
0.070 g/ml of HMF in the organic phase at 
77% HMF selectivity and 92% conversion 
(Table 1, run 27). 

A heterogeneous catalyst is more easily 
separated from the product and recycled 
than a homogeneous catalyst, leading us to 
investigate fructose dehydration with the 
use of an acidic ion-exchange resin. These 
experiments were conducted at 363 K under 
autonomous pressure (Table 1, runs 28 to 
39). When we used the resin catalyst, we 
observed trends in modifier impact that 
were similar to those we saw with HCl; 
however, the absolute selectivities were 
lower for the resin system. Kuster (6) re- 
ported that elevated temperatures (>453 K) 
favor higher HMF selectivity because at 
these conditions the rate of HMF formation 
is 4 to 10 times as fast as the rate of HMF 
degradation. Because ion-exchange resins 
degrade at these higher temperatures, it is 
desirable to replace HCl with a heteroge- 
neous acid catalyst that is stable to 450 K. 
In this respect, and considering the favor- 
able results we obtained using H,PO, (Table 
1, run 43), we tested a niobium phosphate 
catalyst at 453 K, obtaining promising 
results of 73% HMF selectivity at 62% con- 
version. In addition, Dias ef al. (25) have 
shown that nanoporous materials possessing 
sulfonic acid groups can promote dehydra- 
tion of xylose to furfural at elevated temper- 
atures. Carlini et al. (26) observed fructose 
dehydration at low temperatures (353 K) 
and mostly at low conversion (<50%) with 
the use of vanadyl-phosphate—based cata- 
lysts, which may also withstand elevated 
temperatures. 

Experiments conducted at low tempera- 
ture and low conversions (Table 1, runs 29 


Runs are based on runs in Table 1. The selectivity is set to the value 
obtained experimentally, and conversion is assumed to be 90%. The yield 
is calculated based on the HMF present in the organic stream sent to the 


corresponds to the HMF concentration entering the evaporator in Fig. 3. evaporator. 
Run Aqueous phase composition Organic phase Selectivity [HMF].,, [HMF],.. Y (%) Yn (%) 
no. composition (%) (g/ml) (g/ml) 
30 wt % fructose 
2 Water MIBK 60 0.0075 0.045 48 34 
4 Water 7:3 MIBK:2-butanol 70 0.0001 0.057 61 43 
6 8:2 Water:DMSO MIBK 67 0.025 0.048 48 35 
7 8:2 Water:DMSO 7:3 MIBK:2-butanol 75 0.0009 0.063 66 48 
12 7:3(8:2 Water:DMSO):PVP MIBK 75 0.024 0.057 56 44 
13 7:3(8:2 Water:DMSO):PVP 7:3 MIBK:2-butanol 83 0.0032 0.071 73 56 
50 wt % fructose 
16 Water MIBK 47 0.0026 0.054 39 27 
26 7:3(8:2 Water:DMSO):PVP 7:3 MIBK:2-butanol 62 0.0019 0.091 53 43 
27 7:3(8:2 Water:DMSO):PVP 7:3 MIBK:2-butanol 77 0.0055 0.071 67 51 
www.sciencemag.org SCIENCE VOL 312 30 JUNE 2006 1935 
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and 32) offer mechanistic insight about the 
role of DMSO in enhancing HMF selectiv- 
ity. The results show that DMSO primarily 
increases the rate of fructose conversion 
into HMF and to some extent decreases the 
rates of undesirable parallel reactions. 
Likewise, earlier work has suggested that 
DMSO suppresses both the formation of 
condensation by-products and HMF re- 
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Fig. 2. Effects of changing the aqueous phase 
composition from water (W), to 8:2 w/w water: 
DMSO (W:D), to 7:3 w/w water:PVP (W:P), to 7:3 
w/w (8:2 water:DMSO):PVP (W:D:P). (A) HMF 
selectivity with 30 wt % fructose feed. White and 
gray bars represent MIBK and MIBK:2-butanol 
extracting solvents, respectively. (B) Extraction 
ratio, R. White and gray bars represent MIBK 
and MIBK:2-butanol extracting solvents, respec- 
tively. (C) HMF selectivity with MIBK:2-butanol 
extracting solvent. White and gray bars repre- 
sent 30 and 50 wt % fructose, respectively, and 
hatched bars show the improvement obtained 
with the use of double the amount of extracting 
solvent. 


hydration by lowering the overall water 
concentration (6, /3). Similar studies have 
indicated that DMSO both favorably shifts 
the equilibrium of the rate-determining step 
in HMF production and inhibits acyclic 
reaction sequences that may lead to un- 
desirable intermediates (23). 

The reaction of 10 wt % fructose in 7:3 
water:NMP with MIBK as the extracting sol- 
vent and an acidic ion-exchange resin catalyst 
generated 68% HMF selectivity at 80% 
conversion, indicating that the addition of 
NMP to water enhances the selectivity of 
HMF production from fructose. Because 
NMP is an aprotic solvent with properties 
similar to DMSO, NMP appears to act 
through mechanisms similar to those of 
DMSO to enhance HMF selectivity in the 
fructose dehydration reaction. However, 
whereas the carryover of DMSO from the 
aqueous phase into the organic phase is not 
considerable (<0.8 wt % DMSO in MIBK 
after contacting an 8:2 water: DMSO aqueous 
solution, as measured by HPLC), the carry- 
over of NMP into the organic phase is high 
(~5 wt % NMP in MIBK after contacting a 
7:3, water:NMP aqueous solution, as mea- 
sured by HPLC), thereby complicating the 
subsequent separation of HMF from the 
organic phase by evaporation. Notably, we 
found that replacing NMP with PVP, a stable 
hydrophilic polymer that has NMP moieties 
along the polyethylene chain, preserves the 
benefits on selectivity produced by NMP but 
eliminates organic phase contamination due to 
the low solubility of PVP in the extracting 
solvent. In an analogous manner, grafting 
DMSO onto a hydrophilic polymeric backbone 
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could be used as a strategy to eliminate trace 
amounts of DMSO in the organic phase. 

The addition of 2-butanol to MIBK en- 
hances HMF selectivity by removing HMF 
more efficiently from the reactive aqueous 
medium. Notably, increasing the extraction 
ratio R and/or increasing V,,,/V,, can coun- 
teract the faster rate of HMF degradation in 
the presence of fructose. This undesirable 
reaction between fructose and HMF is 
reflected in lower HMF selectivities at 50 
wt % fructose when compared with 30 wt % 
(Table 1), and we observed directly that 
lower selectivities are obtained when con- 
trolled amounts of HMF are added initially to 
the fructose reaction system. In addition, 
separating HMF from the aqueous medium 
lowers the rate of HMF rehydration into 
levulinic and formic acids. Analyses by gas 
chromatography—mass spectrometry (GC-MS) 
of the aqueous and organic phases after con- 
version of 30 wt % fructose showed that the 
general composition of the by-products cor- 
responds to typically 10% rehydration, 5% 
dehydration, 5% fragmentation, and 80% 
condensation compounds. 

We performed simulations for selected 
experiments from Table 1 to estimate the 
HMF concentrations that would be obtained 
by combining our batch reactor experiments 
with a countercurrent extractor to remove the 
HMF remaining in the aqueous layer (Fig. 3). 
The final amount of HMF obtained by 
combining the organic streams from the 
reactor and the extractor (i.e., the stream 
entering the evaporator) is used to calculate 
the energetic yield (Yn) as a measure of the 
overall efficiency of our process for obtaining 


Organic phase 


Epp iaalicts Aqueous phase 


Fig. 3. Batch process for production of HMF from fructose with simulated countercurrent extraction 
and evaporation steps. The aqueous phase (white) contains fructose, DMSO, PVP, and the acid 
catalyst and is represented in the bottom half of the batch reactor. The organic phase (gray) 
contains MIBK or MIBK:2-butanol and is represented in the top half of the batch reactor. 
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HMF by solvent evaporation. The energetic 
yield is the product of the HMF yield (Y), 
defined as the moles of HMF in the stream 
entering the evaporator in Fig. 3 divided by 
the total moles of fructose fed to the batch 
reactor, and an energy efficiency (n), defined 
as the heat of combustion of the HMF 
product (AH. y;4) minus the energy neces- 
sary to evaporate the solvent (AH. 


‘ abate! 
normalized by the energy content of the 


product [ie., n = (AA ume - ATs eg)! 
AH. mel. These simulations used the exper- 


imental selectivity for each system (Table 1), 
which was assumed to remain constant at 
90% conversion, the experimental value of 
Veal V sg for the batch reactor, and the 
experimental value of R to model a counter- 
current extractor operating with equal vol- 
umes of aqueous and organic streams. 
Aqueous and organic phase modifiers im- 
prove the value of Yn, thus reducing energy 
expenditures required to obtain the HMF 
product when compared with the water/ 
MIBK system (Table 2). 

The value of Yn alone does not address the 
difficulties of using high-boiling organic sys- 
tems. For example, although a theoretical 
value of Yn > 75% can be obtained with pure 
DMSO, the HMF product cannot be separated 
from the solvent by simple evaporation. Pre- 
vious work has shown that because of the reactive 
nature of concentrated HMF at high temper- 
atures, distillation of HMF from DMSO leads to 
substantial carbonization of the product (/0). 
Low-temperature separation processes such as 
vacuum evaporation and vacuum distillation 
have been used to separate various solvents 
and by-products from HMF mixtures, but no 
experimental data have been reported for 
DMSO (27-29). Accordingly, we used Aspen 
Plus Simulation Software (Version 12.1, Aspen 


Technology Inc.) to compare energy require- 
ments for the separations of HMF from a 
low-boiling solvent (pure MIBK) and from a 
high-boiling solvent (pure DMSO) for vacuum 
evaporation and vacuum distillation processes 
(for HMF levels of 10 wt %). Vacuum evap- 
oration simulations predicted that 99.5% of the 
MIBK solvent can be evaporated at 13 mbar 
and 343 K with a 2.5% loss of HMF, whereas 
evaporating DMSO at 1.3 mbar and the same 
temperature resulted in a 30% loss of HMF. 
Consequently, HMF separation from DMSO 
with minimal losses requires the more expen- 
sive vacuum distillation process (e.g., 0.66 
mbar and a bottom temperature of 386 K). 
When comparing both solvents with the use of 
vacuum distillation, simulations predicted that 
an efficient separation of HMF from pure 
DMSO requires 40% more energy as com- 
pared with pure MIBK, clearly showing the 
advantages of using a low-boiling solvent 
system. 
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An Octahedral Coordination 


Complex of Iron(VI) 


John F. Berry,* Eckhard Bill,? Eberhard Bothe,? Serena DeBeer George,” Bernd Mienert,* 


Frank Neese,*+ Karl Wieghardt*+ 


The hexavalent state, considered to be the highest oxidation level accessible for iron, has 
previously been found only in the tetrahedral ferrate dianion, FeO,,2-. We report the photochemical 
synthesis of another Fe(VI) compound, an octahedrally coordinated dication bearing a terminal 
nitrido ligand. Méssbauer and x-ray absorption spectra, supported by density functional theory, are 
consistent with the octahedral structure having an Fe=N triple bond of 1.57 angstroms and a 
singlet a ground electronic configuration. The compound is stable at 77 kelvin and yields a high- 


spin Fe(III) species upon warming. 


in the natural world, generally occurs in com- 
pounds of its divalent or trivalent ions. Iron 
ions in these two oxidation states occur in a wide 
range of minerals (/, 2) found terrestrially as 


T= the most abundant transition element 


well as on Mars (3, 4) and are widely used in 
biological electron transfer processes, such as 
those involving iron-sulfur clusters (5). Low- 
valent, electron-rich compounds play key roles 
in biological hydrogen production, for example, 


Fe(I) and Fe(0) (6-8). Nature also uses more 
electron-deficient iron centers for highly specific 
and efficient enzymatic oxidation (9/1), which 
has spurred chemists to prepare analogous 
coordination complexes for use in synthesis 
and oxidative waste remediation. Notable among 
these are Fe(IV) complexes that activate H,O, 
(12) and Fe(IV)=O species that hydroxylate 
hydrocarbons (13, /4), as well as several other 
Fe(IV) (15-19) and Fe(V) (20, 27) compounds. 

Although iron has eight electrons in its 
valence shell, only the +6 state is considered 
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accessible and only a single Fe(VI) species 
is known. This ferrate ion, [FeY'O,]?, is an 
especially powerful oxidant used in organic 
synthesis (22), soil and wastewater treatment 
(23, 24), batteries (25), and disinfectants (26). 
Its reduction product, rust, is innocuous—an 
advantage over many other oxidants—but 
ferrate is unstable and often indiscriminately 
reactive. The search for additional hexavalent 
iron species that may react in a more specific 
and controlled way has been long and fruit- 
less. Here, we report the low-temperature 
synthesis and characterization of a second 
Fe(VI) compound. 

One successful approach to the synthesis of 
high-valent iron species has been photolysis of 
lower-valent azido complexes to yield either a 
photoreduced or photo-oxidized product: 


N3 
“im hv 


Pee +15N, (1) 


I 


3 N 

“hah. yy ye “om ] (n+2}+ 

iS res + N (2) 
Homolytic Fe—N bond cleavage in the reductive 
pathway (Eq. 1) returns one electron to the 
metal center and produces an azide radical that 
quickly decomposes to dinitrogen. In the oxi- 
dative pathway (Eq. 2), dinitrogen is formed via 
heterolytic N-N bond cleavage, leaving a 
terminal nitrido group and formally oxidizing 
the iron center by two electrons. This synthetic 
approach was first applied to iron-porphyrin 
species (27) and later to nonheme iron com- 
plexes (20, 21). In our work, the Fe(IID-azido 
complex [(cy-ac)FeN,]PF,, 1, where cy-ac is 
the anion 1,4,8,11-tetraazacyclotetradecane-1- 
acetate (Scheme 1), was photolyzed at 77 K 
with 420-nm light to produce the Fe(V)-nitrido 
species [(cy-ac)FeN]PF,, 2 (2/), which has 
been shown to have a nearly orbitally degener- 
ate doublet ground state due to the Fe-N bond 
configuration o?14n*! (bond order 2.5) (28). 

Recently, we reported the Fe(III)-azido com- 
plex 3 in which the N-H groups in the cy-ac 
ligand of 1 have been replaced by N—Me (methyl) 
groups (29). This change has important steric and 
electronic ramifications for the iron center, one of 
which is that 1 has a low-spin electronic ground 
state whereas 3 is high-spin (30, 37). Another 
major difference between 1 and 3 is that 3 may 
be reversibly oxidized to the stable Fe(IV)-azido 
complex [(Me,cy-ac)FeN,](PF,),, 4, whereas 
electrochemical oxidation of 1 does not lead to 
a stable Fe(IV) product (29). 

Although photolysis of 1 leads efficiently 
to the Fe(V) complex 2, photolysis of 3 under 
identical conditions yields only a high-spin 
Fe(II) product, identified by Méssbauer spec- 
troscopy. The isomer shift (5 = 1.04 mm s~!) 
and quadrupole splitting (AE, = 2.59 mm s~') 
of the photoreduced complex suggest that 


the product is the octahedral species 
[(Me,cy-ac)Fe(NCCH,)]*. No conditions have 
yet been found to induce photo-oxidation of 3. 

Compound 4 is also photochemically reac- 
tive. Upon laser irradiation at 77 K with 650-nm 
light, frozen solutions of 4 change color from 
cherry-red to pale yellow. A frozen Méssbauer 
sample of 4 (generated electrochemically in 
acetonitrile solution containing 0.1 M NBu,PF,, 
where Bu = butyl) was irradiated for 5 hours 
under these conditions, and the Méssbauer 
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Scheme 1. 


spectrum (Fig. 1A) is well simulated by two 
subspectra, neither of which corresponds to 4 
(5 = 0.11 mm s7!, AE, = 1.92 mm s'). 
Subspectrum b, which accounts for 27% of the 
iron in the sample, is broad, with 5 = 0.53 mm s~! 
and AE,, = 1.13 mm s“!, consistent with a 
high-spin Fe(III) species that may form via 
photoreduction. The major species present (sub- 
spectrum a) has a very low 6 value of-0.29 mm s7! 
and AE, of 1.53 mm s™!. This 6 value is 
0.40 mm s~! lower than that of 4 and 0.19 


Fig. 1. Méssbauer spec- 
tra of a photolyzed aceto- 
nitrile solution of 4 
measured at 80 K (A) 
and at 4.2 K in an ap- 
plied magnetic field of 
7 T (B). Subspectrum a 
corresponds to 5 and was 
the only subspectrum 
that was fitted in (B). 


Table 1. Comparison of experimental and calculated properties of 5. Errors are SDs, v(C=O) is the C=O 
stretching frequency, and 7 represents the asymmetry parameter of the electric field gradient tensor. 


Fe=N (A) Fe-N,, (A) 


Fe-O (A) 


v(C=0) (cm~2) 5 (mm s~?) AE (mm s~?); y 


Experiment 1.57 + 0.02 2.03 + 0.02* 2.03 + 0.02* 


Calculation 1.532 2.124 1.911 


1706 -0.29 +1.53; 0.33 
1740 -0.31 +0.75; 0.52 


*Only an average Fe-N/O distance could be refined from the EXAFS data. The values 2.06 A for the Fe-N bonds and 1.92 A for 
the Fe—O bond are in reasonable agreement with this average value. 
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mm s_! lower than that of 2, which suggests 
that species “a” has a higher formal oxidation 
state than either 4 or 2. We therefore assign this 
species as [(Me,cy-ac)FeN](PF,),, 5, an Fe(VI)- 
nitrido complex. 

The 6 value of 5 is considerably higher than 
that of the ferrate ion in various salts (—0.79 to 
—0.85 mm s~! at 80 K) (32), which is the only 
other Fe(VI) species known. Note that there are 
major differences between the geometric and 
electronic structures of these two species that 
prohibit the direct use of their 6 values to infer 
electron deficiency. The most readily apparent 
difference is in the coordination number of the 
iron. The influence of the coordination number 
of the iron on the isomer shift is well 
demonstrated by comparing salts of the 
[FeCl]? trianion with those of the [FeCl,}- 
anion; the isomer shifts in the former cases are 
typically larger by 0.23 mm s~! (33), which 
reflects the difference in overall covalency due 
to different metal-ligand bond lengths. Also 
important is the fact that the ferrate dianion is 
paramagnetic with a spin triplet ground state 
(S = 1), but 5 is diamagnetic, as evidenced by 
its Méssbauer spectrum measured in an 
applied magnetic field of 7 T (Fig. 1B). 


Although the spin state often has a striking 
effect on the Méssbauer isomer shift in Fe(II) 
and Fe(III) species because of the population 
of o antibonding orbitals in the high-spin 
states, typically this causes the high-spin 
species to have higher 5 values due to longer 
metal-ligand bond lengths and lower co- 
valency. In the case of d? Fe(VI) ions, we 
expect this effect to be diminished because the 
6 antibonding orbitals will not be populated in 
the high-spin state. Another major difference 
between ferrate and 5 is that there are four 
strong and covalent Fe=O double bonds in the 
ferrate ion, whereas 5 has only one strong 
iron-nitrogen multiple bond (see below), and 
this effect is likely the dominating influence in 
the variation of the isomer shifts of these 
species. 

It is useful to compare the isomer shift 
of 5 with those of other low-spin octahedral 
complexes with similar supporting ligands 
(Fig. 2) and electronic configurations ranging 
from d° to d?. Because the compounds within 
this series use similar supporting ligands and 
their valence electrons exclusively occupy the 
t,,-based orbitals, a nearly linear correlation 
between the isomer shift and oxidation state is 
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observed, which further supports the assign- 
ment of 5 as a genuine Fe(VI) complex. 

Electrochemical oxidation of 3 to 4 and 
subsequent photolysis to 5 was also monitored 
by infrared spectroscopy using an optically 
transparent thin layer electrode (OTTLE) cell. 
The spectral changes from 3 to 4 were as de- 
scribed in (29). Upon irradiation with laser light 
at 650 nm, a single new species is observed on 
the basis of a C=O stretching band at 1706 cm7! 
that grows concurrently with the diminishing 
intensity of the N, stretching band of 4 (34). 
We have determined a linear correlation be- 
tween the C=O stretching frequencies of 
Me,cy-ac complexes and the Fe-O bond 
distance (35), and the C=O stretch in 5 cor- 
relates well with an Fe—-O bond distance of 
~1.9 A (fig. S1), which is fairly long for a 
bond to a highly oxidized metal center but is 
consistent with a bond trans to a short and 
strong iron-nitrogen multiple bond. 

To further confirm our assignment of 5 as 
an Fe(VI)-nitrido complex, we analyzed the 
x-ray absorption spectrum. The edge and pre- 
edge region are shown to the left in Fig. 3, 
where they are compared to the corresponding 
data for the Fe(V) complex 2 (28). There is an 
intense pre-edge peak in each case, consistent 
with the presence of a strongly covalent iron- 


Fig. 2. Plot of the isomer shifts 0.8 7116 ligand multiple bond, and the area of this peak 
oe . pxiesson state - the on = 66 plas = for 5 (39 + 2 units) is significantly higher than 
owing low-spin iron comp! ee wi = ae Y — 2 for 2 (27 + 2 units) (28). The peak area is also 
Fela ar Ras if BAL = 04|*_~ rs 7114 &% higher than those observed for Fe(IV)-oxo 
se baal dae ai oes ts vale ns r oe 3 species (36, 37), reflecting a greater cova- 
(21), trans-l{cyc am)Fe(N3),1* (20), & 0.2 Nast 7113 3 lency of the Fe=N bond in 5. The energy of 
[(cy-ac)FeX]+ with X = N,, Cl, OTf 5 ” mi : : 
. B s this peak in 5 (7115.1 eV), as well as of the 
(CF,SO,), and F (21, 30), low-spin w o i oS 0 ; : : : 
ae: Oo s 7112 3 edge itself (7125.0 eV), is ~1 eV higher in 
[(Me,cy-ac)FeN,]* (29), [trans-  & A io) he 
2 3 ‘ !, energy than it is for 2 (7114.2 eV for the pre- 
(cyclam)FeN.],N (20), [(cy-ac)Fe],N S -0.2 ie ~~ 7111 3 d d 7124.1 eV fi h d i 
(21), [(Me, cyclam)FeO]2+ (43), = 1. [3 e ige an has e or the e Ige), consistent 
[(cy-ac)FeO]}+ (22), [(TMCS)FeO]+ -0.4 7110 with an oxidation state one unit higher in the 


(TMCS = 1-mercaptoethyl-4,8,11- 1 
trimethyl-1,4,8,11-tetraazacyclo- 
tetradecane) (36), [(Me,cy-ac) 


2 3 4 § 6 
Oxidation Number 


FeN,]?+ (29), [trans-(cyclam)FeN(N,)]* (20), [(cy-ac)FeN]* (21), and 5. A plot of the Fe x-ray 
absorption spectroscopy (XAS) pre-edge peak energies of Fe(III), Fe(IV), Fe(V), and Fe(VI) complexes 
versus Fe oxidation state is also shown (white squares). The dashed and dotted lines represent least- 
squares fits [the Fe(IV) complex [(cy-ac)FeO]OTf is excluded in the former]. 


former. The pre-edge energies of 1, 2, 5, and 
of the Fe(IV)-oxo complexes reported by Que 
(37) [averaged and recalibrated as noted in 
(28)] show a nearly linear relationship with 
oxidation state (Fig. 2). 

The extended x-ray absorption fine struc- 
ture (EXAFS) region of 5 was also analyzed 


Fig. 3. (Left) Normalized Fe K-edge XAS 1.2 = 
spectra of 2 [dotted line, data taken | od 
from (28)] and 5 (solid line) with an 4, 4° 
expansion of the pre-edge region (inset). § $o15| j 
(Right) Non—phase-shift-corrected €& Bie i 
Fourier transform of the EXAFS data of S 0.8 E i 2 
5 (inset). The solid lines are the data; 4 ee ij 2 
dotted lines represent fits of the data. 3 06 oe ie ie dj & 
N Energy (eV) é = 
= b 
E 0.4 Fj 
fe) 4 
z Py 
0.2 me 
Z Y, 
0 l n 
7105 7115 7125 
Energy (eV) 
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Fig. 4. (Left) Structure of the dication of 5 from DFT calculations with hydrogen atoms omitted. Color 
code: brown, iron; gray, carbon; blue, nitrogen; red, oxygen. (Right) Calculated 3d orbital manifold. 


(Fig. 3, right, and table S1) and is best fit with a 
model in which there is one short Fe-N bond 
length of 1.57 A and an average of five Fe-N or 
Fe—-O bond distances of 2.03 A [which could not 
be deconvoluted into separate Fe-N and Fe-O 
distances (38)]. The spectrum is consistent with 
the overall geometry predicted by a density 
functional theory (DFT) calculation on the 
dication [(Me,cy-ac)FeN]** (Fig. 4, left, and 
Table 1). 

The calculated Fe=N distance of 1.53 A 
agrees well with the EXAFS result, and the 
Fe—O distance of 1.90 A, which is elongated as 
a result of the trans influence of the short iron- 
nitrogen triple bond, agrees well with the 
infrared data and the absence of acetate-to-iron 
charge transfer bands in the visible spectrum of 
5. These electronic absorptions are observed in 
the visible spectrum of 4, which appears to 
have a shorter Fe-O distance. The calculated 
Fe-N distances are elongated by ~0.06 to 
0.09 A relative to the EXAFS data (38), but 
this discrepancy is a systematic error in the 
calculation that has been observed in the cal- 
culated geometries of several other Me,cy-ac 
complexes (30). 

Vibrational and Méssbauer properties were 
also calculated for the dication [(Me,cy-ac)FeN]** 
(Table 1) and agree well with the values found 
experimentally. The calculated C=O stretch is 
calculated to be ~30 cm7! higher in energy 
than the observed value, which is a systematic 
overestimation for Me,cy-ac complexes (fig. 
S1). Although an experimental value is lacking, 
the computed Fe=N stretching frequency of 
1064 cm™! in 5 is similar to that observed in 
the isoelectronic complex [(cy-ac)MnYN]PF,, 
1011 cm™! (39), and also in the Fe(IV) com- 
plex [PhB(CH,P(‘Pr),),]FeN (where Ph = 
phenyl and ‘Pr = isopropyl), which shows an 
Fe=N triple bond stretch at 1034 cm~! (J6). 
The isomer shift of 5 is almost perfectly 
reproduced in the calculation, which supports 
our assignment of the oxidation state of 5 and 
also lends confidence to the calculated elec- 


tronic structure detailed below. The sign of 
AE, is accurately calculated, and the magni- 
tudes of the experimental and calculated values 
are consistent with a d? ion with a doubly 
occupied nonbonding d,,, orbital and a strongly 
covalent iron-nitrogen multiple bond. 

We were particularly interested in analyzing 
the nature of the Fe=N bond in the context of 
the evolving understanding of metal-ligand 
multiple bonds (40, 4/). On the basis of our 
DFT electronic structure calculations (Fig. 4, 
right), the highest occupied molecular orbital 
(HOMO) is best described as a doubly occupied 
d,.,, orbital. The xz and yz orbitals are highly 
destabilized in energy because of considerable 
contributions (45%) from p orbitals of the 
terminal nitrido ligand and are best considered 
to be Fe-N n* antibonding orbitals. Occupancy 
of these orbitals reduces the Fe—N bond order, 
as in the case of 2, which exhibits an Fe-N 
bond order of 2.5 (28). The calculated bond 
order [from Léwdin analysis (42)] in 5 is 2.82, 
justifying the first-order description of 5 as an 
Fe(VI)-nitrido complex with an Fe=N triple 
bond. This view is fully consistent with the 
decrease in the Fe=N bond distances from 
1.61 A in 2 (28) to 1.57 A in 5, which is 
comparable to the Mn=N triple bond (1.54 A) 
observed in [(cy-ac)MnN]+ (39) and is 0.08 A 
longer than the calculated Fe=N triple bond 
distance in the tetrahedral Fe(IV)-nitrido com- 
plex [PhB(CH,P(‘Pr),),]FeN (16) as expected 
from the difference in coordination number. For 
comparison, a typical Fe=O double bond length 
in Fe(IV)-oxo species is 1.65 A (37, 43, 44). 

Compound 5 is stable for months in frozen 
solution at 77 K and is stable for some time in 
fluid solution at 40°C. A Méssbauer spectrum 
of a solution thawed to -40°C for 10 min 
showed only 29% diminished intensity of the 
signal due to 5, which remained the major 
species. The decomposition product evident in 
the spectrum could be assigned as a high-spin 
Fe(III) species because of its Méssbauer pa- 
rameters. Thus, 5 readily gains three electrons. 


The discovery of 5 has implications for 
biological systems for which high-valent iron 
intermediates have often been proposed. Al- 
though Fe(IV)-oxo species have been observed 
in some cases [(45-47); see also (48)], concrete 
evidence of an Fe(V) intermediate is lacking. 
Postulation of Fe(V) intermediates is often met 
with skepticism because very few genuine Fe(V) 
coordination complexes are known. The results 
reported here should support further consideration 
of proposed high-valent iron intermediates. 
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Molecular Recognition in the 
Selective Oxygenation of Saturated 
C-H Bonds by a Dimanganese Catalyst 


Siddhartha Das, Christopher D. Incarvito, Robert H. Crabtree,* Gary W. Brudvig* 


Although enzymes often incorporate molecular recognition elements to orient substrates 
selectively, such strategies are rarely achieved by synthetic catalysts. We combined molecular 
recognition through hydrogen bonding with C-H activation to obtain high-turnover catalytic 
regioselective functionalization of sp? C-H bonds remote from the —-COOH recognition group. The 
catalyst contains a Mn(j1-O),Mn reactive center and a ligand based on Kemp's triacid that directs a 
—COOH group to anchor the carboxylic acid group of the substrate and thus modify the usual 
selectivity for oxidation. Control experiments supported the role of hydrogen bonding in orienting 


the substrate to achieve high selectivity. 


elective functionalization of C-H bonds 

at sites in molecules remote from more 

reactive substituents is a major chal- 
lenge in synthetic chemistry (7). In biological 
systems, monooxygenases and fatty acid 
desaturases have long been known to com- 
bine molecular recognition with reactive 
catalytic metal centers and thereby oxidize 
hydrocarbons with very high regio- and 
stereoselectivity (2-9). Multiple amino acid 
residues interact with the substrate via non- 
covalent interactions, such as hydrogen bond- 
ing and n-stacking, to position a specific 
reaction site in a precisely favorable ori- 
entation relative to a (typically iron-based) 
catalytic center. The ubiquitous hydroxylating 
cytochrome P450—based enzymes, for exam- 
ple, having an iron-porphyrin in the active site, 
constitute the most widely distributed class of 
regioselective monooxygenases (2, 3). Nonco- 
valent recognition is important to ensure ade- 
quate lability for catalytic turnover. This 
strategy is hard to implement in synthetic 
systems that, unlike proteins, must rely on 
much smaller, simpler scaffolding structures 
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to influence substrate orientation. In one case, 
Breslow and co-workers grafted porphyrin- 
based manganese catalysts to cyclodextrin 
groups, which anchored substrates in a favor- 
able orientation through hydrophobic inter- 
actions (/0—/3). Regioselective oxidation has 
also been achieved with other catalysts, but in 
this case substrates were aligned through 
covalent bonds, which prevented turnover 
(14-16). 

Here we report a nonporphyrin di-j1-oxo 
dimanganese compound that catalyzes the 
highly regioselective oxygenation of saturated 
C-H bonds in ibuprofen and (4-methylcyclohexyl) 


H(4) 


a 
C(33) 


(3) 
C(31) 
C(25) C(26) 


C(32) 


cl27) 
Fig. 1. ORTEP diagram of ligand 2. 


acetic acid (cis + trans) with >100 turnovers. 
The selectivity arises from noncovalent mo- 
lecular recognition via reversible H-bonding 
between a carboxylic acid group of the 
catalyst and a carboxylic acid group of the 
substrate. 

The complex [H,O(L)Mn(j1-O).MivL)OH,|(NO,), 
(la, L is 2,2’:6'2’-terpyridine) has earlier been 
reported as a very active catalyst for oxidation 
chemistry with Oxone (peroxomonosulfate) as 
primary oxidant (17, 18). Good evidence was 
obtained in this system to exclude the inter- 
mediacy of freely diffusing radicals, which 
would otherwise degrade the selectivity (/9). 
Our present study uses the same di--oxo 
dimanganese core, but with a different ligand, 
2 (Fig. 1). Our ligand design addressed several 
key criteria: the need to stabilize high-valent 
manganese, an oxidation-resistant frame- 
work, and a -COOH group properly directed 
for molecular recognition. In ligand 2, the 
terpy group was chosen to stabilize high- 
valent manganese (20, 21). The phenylene 
linker provides a spacer between the docking 
element and the remote site of functionaliza- 
tion. The Kemp’s triacid fragment provides a 
U-turn motif, a —COOH group suitably 
oriented for the molecular recognition function. 
The structure of the ligand was confirmed by 
x-ray crystallography (Fig. 1) (22). A suitable 
catalyst precursor, [(2)MnCl,], was prepared 
by refluxing a solution of ligand 2 in aceto- 
nitrile and a saturated aqueous solution of 


C(7) C16) 
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excess MnCl,. Subsequent oxidation of 
[(2)MnCL] with 0.80 eq of Oxone in CH,CN- 
H,O mixture (1:1) gave [H,O(2)Mn(u- 
O),Mn(2)OH,](NO,), (1b) (22, 23). 

Molecular modeling allowed us to pre- 
dict which C-H bond in the substrate would 
be expected to come closest to the active 
site. The geometry of the proposed H-bonded 
catalyst-substrate complex was obtained by 
importing the crystal structure parameters of 
the ligand 2 (Fig. 1) and the di-u-oxo diman- 
ganese core into the model (/7, 18, 23), docking 
ibuprofen, and then minimizing the energy 
(Fig. 2) (MM2, CAChe 5). We chose ibuprofen 
[2-(4-isobutyl-phenyl)-propionic acid] (3) as an 
appropriately rigid substrate with two sites of 
attack (indicated by bold arrows in Fig. 3). 
Oxidation occurs at the remote benzylic carbon 
to give 4 and at the vicinal position to give 5 
(Fig. 3). If the catalysis operates via the 
catalyst-substrate complex predicted by the 
model (Fig. 2), then 4 should be the major 
product, with any initial -CH(OH) interme- 
diate being rapidly oxidized further to the ke- 
tone (Fig. 3). With another rigid substrate, 
(4-methylcyclohexyl) acetic acid (6¢: cis iso- 
mer; 6t: trans isomer; 6: 6c + 6t) docked via 
hydrogen bonding to 1b, the model (fig. S17) 
suggests that the remote tertiary C-H bond at 
C6 (indicated by the solid arrows in Fig. 3) 
should undergo oxidation preferentially to give 
the corresponding alcohol 7 (22). 

Catalytic runs were carried out with 1 eq of 
substrate, 0.001 eq of 1b, and 5 eq of Tetra- 
butylammonium Oxone in acetonitrile. At room 
temperature, the reaction was quenched after 
2 hours by addition of excess NaHSO,; 
products and unreacted substrate were ex- 
tracted into ether. To confirm the role of dock- 
ing, we performed control experiments using 
[H,O(L’)Mn(u-O),Mn(L’)OH,](C1O,), (le, L’ 
is 4’-phenyl-2,2':6',2”-terpyridine) (23), a cat- 
alyst lacking the key -COOH group; under 
identical conditions, selectivity was lost. We 
also confirmed that no Oxone-induced reac- 
tivity occurs in the absence of the manganese 
catalyst (22). 

When ibuprofen was treated with cata- 
lyst 1e, which lacks the key -COOH group, 
the ratio of remote to proximal oxidation 
products (4:5) was roughly 3:1. With cata- 
lyst 1b, however, the recognition function- 
ality raised the selectivity for 4 more than 
10-fold (Table 1). In time-dependent studies, 
the ratios of the products remained constant, 
confirming that the regioselectivity is not 
due to degradation of the alternative product 
5 (22, 24, 25). 

We ascribe this high regioselectivity to 
the modeled docking H-bonding between 
the -COOH groups of ibuprofen and cata- 
lyst 1b. To test this hypothesis further, we 
added acetic acid to the reaction medium, 
expecting that it would largely displace the 
ibuprofen from the recognition site. If so, 


1942 


the catalytic selectivity would then resemble 
that observed in the absence of recognition. 
Indeed, the 4:5 ratio found in this case fell 
precisely to the level observed with the 
control catalyst 1e (Table 1). 

Oxidation of the alkyl carboxylic acid 
substrate 6, using recognition catalyst 1b, 
led not only to regioselective oxygenation 
at the remote tertiary C-H bond, but also to 
diastereoselection of a single isomer of 7: 
the trans isomer 7t (Table 2). Our control 
experiment with catalyst le yielded several 
other oxidation products in addition to 7t 
(22). This diastereoselectivity can be ratio- 
nalized on the basis of a model (Fig. 4) in 
which the stereochemical outcome is deter- 


Fig. 2. Molecular model of catalyst 
1b (Chem 3D) docked with H-bonded 
ibuprofen (substrate). 


A 
aoe 
ALS 
7" COz 
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mined in the “rebound” step. In this model, 
the carbon radical putatively formed by H- 
atom abstraction from the substrate adopts a 
chairlike conformation (26), which leads to 
the observed product isomer 7t on transfer 
of an OH group from the Mn-OH interme- 
diate to the carbon radical center. Transfer 
of OH to the opposite side of the ring, to 
yield the other product isomer, is not pos- 
sible via any plausible intermediate con- 
formations that we could identify from 
modeling work. 

In a control experiment that rules out 
undesirable autoxidative mechanisms in- 
volving atmospheric molecular oxygen, the 
same results were obtained under a nitrogen 


Oy 
> a 
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Fig. 3. (A) Oxidation products of ibuprofen with 1b as catalyst. Bold arrows indicate alternate sites 
of attack. (B) Oxidation products of 6 (6c + 6t) with 1b as catalyst. Solid arrows indicate possible 


sites of attack according to molecular modeling. 
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atmosphere as were obtained in air. Any 
rebound must, therefore, be fast because the 
proposed carbon radical is not trapped by 
ambient molecular oxygen. Oxidation of 
cis-stilbene yielded cis-stilbene epoxide, 
not the trans epoxide, which is usually the 
major product from a freely diffusing 
radical mechanism. This result suggests that 
a freely diffusing radical mechanism is un- 
likely (19, 27). Further mechanistic studies 
are in progress. 

Initially, we obtained low catalytic turn- 
overs (~50) for ibuprofen oxidation using 1% 
catalyst (relative to substrate), probably be- 
cause catalyst degradation halted the reaction. 
To remedy this problem, we lowered the 


catalyst concentration, thereby reducing the 
probability of bimolecular catalyst self- 
oxidation. With a 0.1% catalyst to substrate 
ratio, a much higher total turnover number of 
580 was attained (catalyst 1b, 0°C) with no 
loss of regioselectivity (98.5%) (Table 1). For 
oxidation of 6, we used 0.1% catalyst in all 
experiments. Further improvement of the cata- 
lytic turnover has been attained by replacing 
the CH,CN solvent with more oxidation- 
resistant CD,CN, thereby reducing the com- 
petitive solvent oxidation. Use of CD,CN 
raised total turnover numbers even further 
without adversely affecting the regioselectivity 
(and diastereoselectivity in the case of 6) 
(Tables 1 and 2). 


Table 1. Product distribution from ibuprofen oxidation [by 1H—nuclear magnetic resonance (1H- 


NMR) spectroscopy]. 


4 5 


Temperature Catalyst Conversion (Favored by (Disfavored by Total 
ae - turnovers* 
recognition) recognition) 
20°C 1b 50% 97.5% 2.5% 50 
1c 53% 77% 23% 53 
o°c 1b 53% 98.5% 1.5% 53 
1c 54% 78% 22% 54 
-20°C 1b 53% 98.5% 1.5% 53 
1c 54% 77% 23% 54 
20°C 1bt 56% 75% 25% 56 
1ct 58% 77% 23% 58 
o°c 1b§ 58% 98.5% 1.5% 580 
20°C 1bt§ 71% 96.5% 3.5% 710 


*Total turnovers = mol products per mol catalyst; subtrate:catalyst:oxidant = 100:1:500. 
tWith CD,CN as solvent instead of CH,CN. 


acid (400% with respect to substrate). 
100:0.1:500. 


tSolutions contained excess acetic 
§Subtrate:catalyst:oxidant = 


Table 2. Product distribution from 6 (6c + 6t) (by 1H-NMR spectroscopy). 


Temperature hatalyst Conversions ih is Ouler Total 

P (0.1%) (favored) (disfavored) products turnovers* 
20°C 1b 13% >99% <1% <1% 130 
20°C 1c ~19% ~30% ~30% ~40% 190 
20°C 1bt 18% >99% <1% <1% 180 


*Total turnovers = mol products per mol catalyst; subtrate:catalyst:oxidant = 100:0.1:500. 


of CH,CN. 


{With CD,CN as solvent instead 


Fig. 4. Molecular model of the intermediate, resulting from H-atom abstraction from C6 of 6 ina 
distorted chair conformation, docked to 1b (Chem 3D). 
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The general strategy may have wide appli- 
cation in chemical catalysis. 
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The Competitive Cost of 
Antibiotic Resistance in 
Mycobacterium tuberculosis 
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Mathematical models predict that the future of the multidrug-resistant tuberculosis epidemic will 
depend on the fitness cost of drug resistance. We show that in laboratory-derived mutants of 
Mycobacterium tuberculosis, rifampin resistance is universally associated with a competitive fitness 
cost and that this cost is determined by the specific resistance mutation and strain genetic 
background. In contrast, we demonstrate that prolonged patient treatment can result in multidrug- 
resistant strains with no fitness defect and that strains with low- or no-cost resistance mutations are 


also the most frequent among clinical isolates. 


ntimicrobial resistance has become a 
Avis problem in many pathogens, 

incurring both economic costs and loss 
of human lives (/, 2). Antibiotic resistance is 
also often associated with a reduced competi- 
tive ability against antibiotic-sensitive strains, 
in the absence of the antibiotic (3). In a variety 
of model systems, it has been shown that this 
fitness “cost” depends on the specific drug 
resistance-conferring mutation and the strain 
genetic background and that it can be amelio- 
rated by compensatory mutations (4-9). How- 
ever, there are few studies linking the predictions 
of these model systems to epidemiological data 
from human populations (/0—12). Here, we con- 
nect the in vitro cost of resistance to the prevalence 
of clinically important mutants of Mycobacte- 
rium tuberculosis, the causative agent of human 
tuberculosis. 

The emergence of multidrug-resistant strains 
of M. tuberculosis is threatening global disease 
control efforts (73). Multidrug-resistant tuber- 
culosis (MDRTB) is defined as tuberculosis 
caused by organisms resistant to isoniazid and 
rifampin, the two most potent first-line anti- 
tuberculosis drugs. MDRTB represents a major 
public health burden, requiring prolonged treat- 
ment with more costly and less effective agents 
(13, 14). To date, most investigations of factors 
determining the spread of MDRTB have focused 
on the quality of tuberculosis-control programs 
and patient adherence to treatment. However, 
recent studies suggest that bacterial factors, 
such as the relative fitness of drug-resistant 
M. tuberculosis strains, are also important 
(15-18). M. tuberculosis develops drug resist- 
ance by the sequential acquisition of chromo- 
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somal mutations (/9), but there are only limited 
data on the effect of different drug resistance— 
conferring mutations on the relative fitness of 
M. tuberculosis (12, 20). Furthermore, there has 
been no attempt to investigate the effects of 
different genetic backgrounds on the competi- 
tive fitness of drug-resistant pathogens, includ- 
ing M. tuberculosis. 

The relative competitive fitness of bacteria 
can be quantified experimentally with the use 
of competition assays originally developed for 
Escherichia coli (21). In these experiments, the 
drug-susceptible and the drug-resistant orga- 
nisms compete for limited resources in a com- 
mon environment. In M. tuberculosis, as in other 
bacteria, resistance to rifampin is mediated 
through missense mutations in rpoB, the gene 
encoding the B subunit of the RNA polymerase 
(19). To determine the impact of different ypoB 
mutations on the relative competitive fitness of 
rifampin-resistant M. tuberculosis, we selected 
a panel of spontaneous rifampin-resistant mu- 
tants by growing M. tuberculosis CDC1551 on 
media containing rifampin. CDC1551 is a clini- 
cal strain and part of a lineage of M. tubercu- 
losis associated with Europe and the Americas 
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(22-24), and it is susceptible to currently used 
antituberculosis drugs. We sequenced known 
rifampin-resistance regions in the rpoB gene of 
all of the 52 colonies recovered and found that 
all of them had a missense substitution in rpoB 
(table S1). We chose nine mutants with dif- 
ferent ypoB mutations and had them compete in 
vitro against their rifampin-susceptible ancestor, 
with the use of an assay that included lag, ex- 
ponential, and early stationary phases of growth 
(21). All of the rifampin-resistant mutants had a 
statistically significant fitness cost compared 
with the susceptible ancestor, which is defined 
to have a competitive fitness of 1.0 (Fig. 1 and 
table S2). Among the different ypoB mutants, 
there were significant differences in competi- 
tive fitness, with values ranging from 0.58 to 
0.91. We found a significant effect of muta- 
tion on fitness [analysis of variance (ANOVA), 
F3g = 12.1563, P < 0.0001]. The Ser°3!— 
Leu?! (S531L) mutant exhibited the lowest 
cost, whereas the Arg>?9>GIn*?? (R529Q) mu- 
tant had the greatest fitness cost of all mutants 
(Fig. 1 and table S2). These results confirm 
that a wide range of rpoB mutations can have 
markedly different effects on the competitive 
fitness of M. tuberculosis in vitro, consistent 
with earlier work that examined the competitive 
cost of a small number of rpoB mutations 
(22, 20). 

Next, we investigated the influence of the 
bacterial genetic background on the cost of rif- 
ampin resistance-conferring mutations. In the 
laboratory, we selected a second panel of spon- 
taneous rifampin-resistant mutants in T85, 
another pan-susceptible isolate that is part of a 
distinct lineage of M. tuberculosis strongly as- 
sociated with East Asia, commonly referred to 
as the Beijing strain (24, 25). All 63 rifampin- 
resistant colonies selected in T85 had a non- 
synonymous substitution in rpoB (table S3). 
We measured the relative competitive fitness of 
four different mutants with rpoB mutations whose 
fitness costs had been measured in CDC1551. 
Similar to the results in the CDC1551 back- 
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Fig. 1. Relative competitive fitness of laboratory-derived rifampin-resistant mutants of M. tuberculosis. 
All mutants had a statistically significant fitness cost (error bars indicate 95% confidence intervals). This 
cost was less in rpoB $531L mutants than in other rpoB mutants, irrespective of the strain background. 
Light gray bars, CDC1551 mutants; dark gray bars, T85 mutants. Y, Tyr; W, Trp; P, Pro. 
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ground, all T85 rifampin-resistant mutants had 
a significant fitness cost (Fig. 1 and table S2). 
We tested for an effect of genetic background 
by comparing the fitness costs of the four mu- 
tations found in each of the two backgrounds. 
We found an overall significant effect of ge- 
netic background when nested in mutation 
(nested ANOVA, F,,, = 6.8962, P < 0.0001), 
and, as observed in the CDC1551 background, 
the rpoB S531L mutant had higher competitive 
ability than other mutations in the T85 back- 
ground (Fig. 1 and table S2). The fitness of the 
S531L mutation did not significantly differ in 
the two genetic backgrounds (i.e., the confi- 
dence intervals overlap; Fig. 1 and table S2). 
However, genetic background did significantly 
alter the fitness of the His**°— Asp°?° (H526D) 
mutation (i.e., the confidence intervals do not 
overlap; Fig. 1 and table S2). Taken together, 
our data confirm that the in vitro relative fitness 
of rifampin-resistant MM. tuberculosis depends 
on both the resistance-conferring mutation and 
the strain genetic background. 

The relevance of empirically measured costs 
of laboratory-derived drug resistance to resist- 
ance that emerges in tuberculosis patients has 
not been established. Therefore, we measured 
the relative competitive fitness of rifampin- 
resistance in paired isolates (one that was drug 
resistant and one that was drug susceptible) from 
10 patients who acquired rifampin resistance 
during antibiotic treatment (Fig. 2 and table S4). 
We used a standard DNA fingerprinting 
method to confirm that the paired isolates 
belonged to the same bacterial clone (26). In 
all 10 patients, both isolates had identical DNA 
fingerprinting patterns (table S5), which 
supports the assertion that the rifampin-resistant 
isolates recovered later were indeed derived 
from the corresponding rifampin-sensitive iso- 
lates. However, DNA fingerprinting methods 
are limited and may not reveal small genetic 
differences which could have been present 
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between isolates. All of the clones were distinct 
but belonged to one of the two strain lineages 
represented by CDC1551 and T85 (tables S4 
and S5). Five of the resistant isolates harbored 
the rpoB $531L mutation and five had other 
rpoB mutations (Fig. 2, table S4). 

All of the clinical isolates with mutations 
other than S531L had a relative fitness of less 
than 1.0, although the measured relative fitness 
was not always the same as that observed in the 
mutants selected in vitro (Fig. 2 and table S4). 
In contrast, four of the five clinical strains with 
the rpoB $531L mutation had a relative fitness 
of greater than or equal to 1.0. Notably, these 
four strains were the only rifampin-resistant 
mutants that had no fitness cost compared with 
their rifampin-susceptible ancestor. 

Two alternative explanations could account 
for the high fitness we observed in clinical strains 
harboring the 7poB S531L mutation. First, this 
mutation may indeed be a no-cost (versus just a 
low-cost) mutation in these particular strains. 
Second, given that sufficient time had passed 
between the isolation of the susceptible and 
resistant members of each pair (table S4), com- 
pensatory mutations could have arisen and 
ameliorated the low initial fitness cost associated 
with this mutation, similar to what has been 
observed in other species of bacteria (4, 5, 8). In 
support of this possibility, we found that the 
clinically derived rpoB S531L mutants had a 
mean competitive fitness of 1.04 (95% confi- 
dence interval: 1.00 to 1.08). This was statistically 
significantly higher than the laboratory-derived 
rpoB §531L mutants, which had a mean com- 
petitive fitness of 0.93 (95% confidence interval: 
0.90 to 0.96). Given that all clinical ancestors of 
the rpoB S531L mutants were already resistant 
to isoniazid or isoniazid and streptomycin (table 
S4), this increase in fitness could be due to 
amelioration of fitness costs associated with the 
other resistance mutations rather than specifi- 
cally the rifampin-resistance mutation. In addi- 


0 + T T T Tr 


5 


10 


6 7 8 9 


isolate pair 


Fig. 2. Relative competitive fitness of clinically derived rifampin-resistant mutants of M. tuberculosis. 
Four of the five mutants with the rpoB $531L mutation (light gray bars) had no fitness cost compared 
with their rifampin-susceptible ancestors. All mutants with other rpoB mutations (dark gray bars) had 
significant fitness defects (error bars indicate 95% confidence intervals). 
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tion, general increases in fitness could have 
occurred during long-term infection, rather than 
specific amelioration of the cost associated with 
a resistance mutation. We cannot differentiate 
between these alternative hypotheses with our 
current data. Nevertheless, our results clearly 
demonstrate that clinical strains of multidrug- 
resistant M. tuberculosis can have a significant- 
ly higher fitness than their progenitors, either 
because of no-cost resistance mutations, com- 
pensatory evolution, or both. 

We hypothesized that those drug-resistant 
strains with the least costly and potentially most 
easily compensated resistance mutation will be 
more likely to spread and become prevalent in 
human populations. This scenario is particularly 
likely given the recent observation that many 
patients harbor subpopulations of the same myco- 
bacterial clone with different drug resistance— 
conferring mutations (27). Many studies have 
reported the frequency of different drug-resistance 
alleles in a variety of clinical settings (28), pro- 
viding an opportunity to test our hypothesis. 
When we compared the relative fitness of the 
different ypoB mutants to their clinical frequen- 
cy, an association became evident (Figs. 1 and 
2 and tables S2 and S4). The rpoB S531L mu- 
tation, which exhibited the lowest fitness cost 
in laboratory-derived mutants and no fitness 
cost in clinical strains, is the most prevalent 
rifampin resistance—conferring mutation, over- 
all accounting for 54% of rifampin-resistant 
isolates, even in parts of the world associated 
with phylogenetically distinct strain lineages 
(24, 28). In contrast, the rpoB R529Q mutant, 
which carried the highest fitness cost of all 
mutants, has never been observed in clinical 
settings (28). Taken together, these findings are 
consistent with the idea that drug-resistant 
strains harboring low- or no-cost mutations 
such as rpoB §531L are selected in patients 
during treatment and that such strains are more 
likely to spread in human populations. 

Several mathematical models have been 
developed to predict the future of MDRTB 
epidemics. Some have assumed a universal 
fitness cost to drug resistance and concluded 
that MDRTB will remain a localized problem 
(15). More recent mathematical models have 
allowed for variable fitness among strains and 
have come to very different conclusions de- 
pending on the specific fitness assumed (/7). 
Our work supports the more complex models 
and suggests that the heterogeneity in fitness is 
a function of the drug-resistance mutation, the 
strain genetic background, and potentially com- 
pensatory evolution as well. These more com- 
plex models suggest that resistance will not 
spread below a relative fitness of 0.7. Interest- 
ingly, all the rpoB mutants tested had a relative 
fitness >0.7, except for the one that has not 
been observed in clinical settings (Figs. 1 and 2). 
None of these models has allowed for changing 
fitness through time, either as a result of com- 
pensatory evolution or other increases in fitness 
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due to within-host evolution. Future mathemat- 
ical treatments of the MDRTB epidemic should 
include such changes. 

Our work suggests that in vitro competition 
assays can be predictive of biological dif- 
ferences important to M. tuberculosis ecology 
and evolution. Our finding that genetic 
background is a factor in the measured cost 
of resistance suggests that the cost of a given 
mutation may vary from location to location 
worldwide owing to the phylogeography of 
this pathogen (24). The possibility that fitness 
changes can occur during infection of a single 
patient should be taken into account when 
applying evolutionary ecology to infectious 
disease treatments or to the prediction of 
epidemics. The availability of such predictive 
methodology permits the incorporation of 
fitness considerations into the design and 
deployment of new drug regimens for control- 
ling tuberculosis. 
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Lhx2 Maintains Stem Cell 
Character in Hair Follicles 


Horace Rhee, Lisa Polak, Elaine Fuchs* 


During embryogenesis, stem cells are set aside to fuel the postnatal hair cycle and repair the 
epidermis after injury. To define how hair follicle stem cells are specified and maintained in an 
undifferentiated state, we developed a strategy to isolate and transcriptionally profile embryonic 
hair progenitors in mice. We identified Lhx2 as a transcription factor positioned downstream of 
signals necessary to specify hair follicle stem cells, but upstream from signals required to drive 
activated stem cells to terminally differentiate. Using gain- and loss-of-function studies, we 
uncovered a role for Lhx2 in maintaining the growth and undifferentiated properties of hair 


follicle progenitors. 


air follicle morphogenesis involves a 
Hewes series of reciprocal inter- 

actions between the ectoderm and its 
underlying mesenchyme (fig. S1) (/—3). In re- 
sponse to an inductive Wnt and an inhibitory 
Bmp signal (Noggin), small hair placodes bud 
from the epithelium, giving rise to larger hair 
germs (4-7). In the presence of the mitogen 
Shh, these hair germs develop further and grow 
downward to form a mature follicle that ac- 
tively produces hair (8-10). Although the mo- 
lecular details of bud formation are still poorly 
defined, the general features of this process are 
repeated at the start of each postnatal hair cycle 
when multipotent stem cells in the hair follicle 
bulge become activated to initiate a new round 
of hair growth. In addition, the early epithelial 
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remodeling to form the hair germ shares many 
features with the development of other epithe- 
lial tissues and organs, including feathers, teeth, 
and mammary glands (//—/3). Understanding 
how tissues form buds that then progress along 
different lineages is predicated on elucidating 
the molecular mechanisms that funnel these 
early signaling pathways into a transcriptional 
program that drives morphogenesis. 

To examine the genetic changes that occur 
during epithelial bud formation, we developed a 
strategy to isolate embryonic hair progenitors. 
To this end, we generated mice doubly trans- 
genic for a Keratin 14-GFP gene expressed in 
skin keratinocytes and the Wnt reporter gene 
TOPGAL, transcribed in hair placodes and 
germs where #-catenin/Lefl complexes are 
active (4, /4). In these early hair progenitors, 
E-cadherin is down-regulated and P-cadherin 
is up-regulated (Fig. 1A) (7). By embryonic 
day 17 (E17), we could use dispase to separate 
the epidermis, including hair placodes and 


germs, from the underlying dermis, which har- 
bored more mature hair pegs and follicles (fig. 
S2). With the use of fluorescence-activated cell 
sorting (FACS) on the epidermal fraction, the 
early “PCAD+” hair progenitors (K14-GFP-+, 
o6-integrin+, P-cadherin+) were then sepa- 
rated from the “PCAD-—” interfollicular epider- 
mis (K14-GFP-+, o6-integrin+, P-cadherin—) on 
the basis of their differential surface P-cadherin 
expression (fig. S3). Characterization of these 
two cell populations confirmed that they had 
similarities in the expression of K5 and B4- 
integrin but distinct activities of TOPGAL and 
the expression of known hair-placode markers 
(Fig. 1, B to D, and fig. $4). 

The gene expression profiles of purified 
PCAD-+ hair progenitors and PCAD- interfol- 
licular basal keratinocytes were further analyzed 
using oligonucleotide microarrays. Using fold 
differences of known hair-placode markers as 
a sensitivity gauge, a twofold cutoff was as- 
signed as a genuine difference between the two 
populations. A total of 1394 probes (660 in 
PCAD-+ and 734 in PCAD-) were preferential- 
ly expressed in one population relative to the 
other (table S1). 

A short list of differentially expressed genes 
that are relevant to the present study is provided 
in table S2. As anticipated, a number of these 
genes have documented roles in either hair 
morphogenesis (PCAD+) or epidermal differ- 
entiation (PCAD-). The interfollicular epidermal 
population was typified by adhesive and cyto- 
skeletal components, Notch signaling factors, 
c-Myc, Kruppel-like factors, and Bmp-responsive 
transcription factors (Grainyhead-like and Ovol) 
previously implicated in epidermal differentia- 
tion (/5—/8). In contrast, the hair germ signature 
featured Wnts, Shh, Bmps, transforming growth 
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factor—B’s, and tyrosine kinase receptor signal- 
ing morphogens, as well as a number of different 
transcription factors. Although some of these 
transcription factors have not been previously 
implicated in the specification of skin progenitor 
fates, others have previously been associated with 
postnatal genetic hair disorders, including Cutll, 
Glil, Hoxc13, Sox9, Trpsl, and Vdr (2, 3). 
Several of the uncharacterized transcription 
factors on this list were also found to be differ- 
entially expressed in the postnatal hair follicle 


Fig. 1. Isolation of embryonic hair 
follicle progenitors. (A) P-cadherin is 
up-regulated at sites of hair follicle 
morphogenesis. This differential ex- 
pression was used to isolate PCAD+ 
hair progenitors and PCAD- inter- 
follicular basal cells by FACS (figs. 
$2 and $3). Scale bar, 40 jum. (B) 
Summary of cytospin analyses for 
epidermal markers K1, K5, and B4- 


bulge (J9, 20) (table S2), suggesting that the 
embryonic hair germ may exhibit functional 
properties similar to those of adult stem cells. 
Although the hair germ is committed to a 
follicular cell fate, it remains undifferentiated 
like bulge stem cells, yet capable of differen- 
tiating into all the lineages of the hair follicle, 
including the sebaceous gland (2/, 22). 

To explore the possibility that the early hair 
germs may reflect hair follicle stem cells and 
regulate key steps in progenitor cell differentia- 
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Fig. 2. Lhx2 is expressed in early 
hair progenitors and postnatal stem 
cells. (A to D) Back skin sections 
from mice at indicated ages were 
stained with antibodies as color- 
coded and counterstained with 
DAPI (blue). Lhx2 is expressed in 
cells at the leading front of in- 
vaginating follicles and in the post- 
natal bulge compartment, but is 
diminished in mature proliferative 
hair progenitors (matrix). Abbrevia- 
tions are as follows: Peg, hair peg; 
Bu, presumptive bulge; Mx, matrix 
cells; DC, dermal condensate; DP, 
dermal papilla; 2°HG, secondary 
hair germ emerging at the start of 
the postnatal hair cycle. Scale bars, 
20 um. 
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tion, we focused on transcription factors emanat- 
ing from our screen that are known to govern 
developmental cell fate specification in other 
tissues and organs. Lim-homeodomain tran- 
scription factor Lhx2 was particularly interesting 
because LAx2 null mutant animals display 
defects in patterning and cell fate determination 
during brain development (23-25). In addition, 
they lack definitive erythropoeisis and converse- 
ly, hematopoetic progenitor cells can be main- 
tained in vitro by forced expression of Lhx2 
(26). Lhx2 null animals die between E15.5 and 
E16.5, and a possible role for Lhx2 in skin has 
not been examined. 

Lhx2 was up-regulated 18-fold in the 
PCAD+ population relative to the PCAD-— 
population, as determined by microarray anal- 
ysis. Semi-quantitative reverse transcription 
polymerase chain reaction (RT-PCR) and in 
situ hybridization confirmed this marked dif- 
ferential expression (fig. S5). When examined 
by immunofluorescence, Lhx2 first appeared in 
early hair placodes, and as morphogenesis pro- 
gressed, became prominent at the leading front 
of invaginating hair germs and pegs (Fig. 2A). 
As downgrowth neared completion and hair 
differentiation began, Lhx2 concentrated in the 
upper outer root sheath (ORS) at a presumptive 
site (bulge) of the developing postnatal follicle 
stem cell compartment (Fig. 2A and fig. S1). 
Concomitantly, expression diminished at the 
base of the follicle, where highly proliferative 
matrix cells give rise to the differentiating inner 
root sheath and hair shaft (Fig. 2B). In adult 
follicles, Lhx2 concentrated in the bulge, and as 
the new hair cycle began, Lhx2 extended to the 
emerging secondary hair germs (Fig. 2, C and 
D). Based on these patterns, we posit that Lhx2 
functions in specifying the embryonic hair fol- 
licle progenitor cells that then persist as bulge 
stem cells in adult follicles. 

To more precisely define Lhx2’s role in 
hair follicle stem cell specification and/or 
maintenance, we examined its status in various 
genetic mutant embryos that are defective in 
different aspects of hair morphogenesis. In the 
complete absence of hair follicle induction or 
bulge maintenance, as reflected in B-catenin 
conditionally null [conditional knockout 
(cKO)] skin, Lhx2 was not expressed (Fig. 3B). 
In Shh knockout embryos, where hair germs 
are specified but unable to progress, Lhx2 
expression was dramatically reduced (Fig. 3C). 
This positioned Lhx2 downstream of Wnt and 
Shh, where it could play a role in establishing 
or expanding the early progenitors necessary 
for hair follicle morphogenesis. 

Bmp signaling is not required for hair follicle 
induction, even though Bmp ligands and recep- 
tors are expressed in embryonic hair germs and in 
postnatal follicle stem cells. Correspondingly, in 
BmpR1a cKO skin, Lhx2 was expressed in both 
embryonic hair germs and the presumptive 
bulge of developing follicles (Fig. 3, D and E). 
Conversely, Bmp signaling is required for dif- 
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ferentiation, and in the absence of BmpR1a, pro- 
liferating undifferentiated hair progenitor cells 
accumulate at the follicle base (27, 28). Lhx2 
was noticeably enhanced in these follicles, with 
strong staining throughout the ORS and matrix 
(Fig. 3, E and F). These cells were also positive 
for Shh and Lefl. Thus, in the absence of ter- 
minal hair differentiation, cells accumulating in 
postnatal BmpR/a null follicles resembled early 
embryonic hair follicle progenitors. 

If Lhx2 governs the gene expression pro- 
gram of undifferentiated follicle stem cells or 
their early progenitors, then misexpression of 
Lhx2 in interfollicular epidermis might result in 
an induction of hair follicle progenitor genes. 
To test this possibility, we engineered K14-Lhx2 


Fig. 3. Lhx2 functions downstream 
of follicle stem cell specification but 
upstream of their differentiation. (A) 
Lhx2 is expressed in P-cadherin— 
positive hair germs. Shown is a rep- 
resentative wild-type littermate at 
E17.5. (B) Lhx2 is not expressed in 
the absence of hair follicle induc- 
tion, as reflected in the B-catenin 
null skin epithelium (5). K14-Cre 
was used to conditionally target 
B-catenin. (C) Lhx2 is reduced in 
Shh null hair germs. Follicles are 
specified in the absence of Shh but 
fail to progress further (8, 20). (D) 
Lhx2 is expressed in BmpR1a con- 
ditional null hair germs. (E and F) 
In addition to expression in the 
presumptive bulge, Lhx2 persists in 
the lower ORS and matrix of neo- 
natal and adult BmpR1a null fol- 
licles. Unable to undergo terminal 
differentiation, these BmpR1a null, 


transgenic mice. Although more hair follicles 
were not induced, Lhx2 markedly suppressed 
morphological and biochemical signs of epider- 
mal differentiation and failed to produce a func- 
tional lipid barrier (Fig. 4A). Most notable was 
the induction of Tcf3 and Sox9 (Fig. 4B), two 
key transcription factors of adult hair follicle 
stem cells (29, 30). Lhx2 also suppressed differ- 
entiation in tongue epithelium (fig. S6). These 
findings suggest that Lhx2 can maintain cells in 
an undifferentiated state, reinforcing the link 
between Lhx2 and stemness. 

If Lhx2 is required for follicle stem cell 
maintenance, then its absence could alter the 
ability of hair follicles to form. In support of 
this notion, E16 LAx2 null embryos displayed 


B-catenin cKO 


BmpRia cKO 


Lhx2-expressing cells appear to be undifferentiated follicle stem cells (27, 28). Scale bars, 40 jum. 


Fig. 4. Gain- and loss-of-function A 
studies reveal a role for Lhx2 in 
promoting follicle stem cell mainte- 
nance. (A) Transgenic expression of 
Lhx2 in the interfollicular epidermis 
suppresses terminal differentiation. 
Morphological and biochemical 
features of spinous, granular, and 
stratum corneum stages of epider- 


mal differentiation are diminished SO SA oa 


in d8 (day 8 after graft) skin from B 
K14-Lhx2 transgenic mice (TG) 
relative to wild-type littermates. 
Patches outlined in black denote 
skins grafted onto Nude mice and 
subjected to a B-galactosidase sub- 
strate exclusion assay to test for an 
intact epidermal barrier. The back- 
ground absorption of dye in the sur- 
rounding Nude skin arises from TG 
defective hair follicle orifices. (B) 

Immunofluorescence reveals an in- 64 
duction of follicle stem cell markers 


WT 


in d4 K14-Lhx2 epidermis. (C) Loss of Lhx2 reduces hair morphogenesis. Repre- 
sentative skin sections from E16 littermates show comparable epidermal differ- 


TG og cat ts 


B4 


an ~40% reduction in the overall density of 
P-cadherin—positive hair follicles (Fig. 4C), with 
no noticeable defect in the epidermis or em- 
bryo size. A marked reduction in follicle den- 
sity is a feature of other mouse mutants in key 
hair follicle morphogenetic genes. Although 
Lhx2 KO follicle density was reduced, Shh, 
Wntl0b, Bmp2, Bmp4, and Lefl expression 
appeared unaffected in those hair placodes and 
germs that developed (fig. S7). In Lhx2 null skin 
engraftments, follicles appeared morphologi- 
cally and biochemically indistinguishable from 
their wild-type counterparts (fig. S8). Taken to- 
gether, the gain- and loss-of-function studies 
suggest that Lhx2, reflecting its expression pat- 
tern, functions to specify and maintain hair 
follicle stem cells but does not function in their 
differentiation. 

If Lhx2 maintains the undifferentiated state 
of embryonic and adult follicle stem cells, then 
Lhx2 null follicles might exhibit alterations in 
the transition of stem cells from the resting 
(telogen) to the growing (anagen) phase of the 
postnatal hair cycle. Using skin grafts, we 
compared the hair cycles of wild-type and 
Lhx2 KO follicles. The initial morphogenetic 
and first postnatal Lhx2 KO hair cycles 
progressed similarly to those in the wild type, 
and by 8 weeks, KO follicles had returned to 
telogen (fig. S9). By contrast, at 11 weeks, 
when most wild-type follicles were still in this 
extended telogen, KO follicles had precocious- 
ly entered the next hair cycle (Fig. 5A). 
Moreover, after being shaved at 8 weeks, most 
wild-type hairs remained in telogen, whereas 
KO hairs consistently and uniformly grew back 
within 3 weeks, confirming their shortened 
resting phase (Fig. 5B). 

Immunofluorescence and FACS analyses 
revealed that KO follicles exhibited diminished 


Filaggrin 


Hair Follicle Density 


: 
4 


follicles/field 


Pc HG Peg total 
hair follicle stage 


entiation but fewer follicles at all stages in Lhx2 null embryos. The graph provides 
quantification from multiple sections of three embryos. Scale bars, 40 tm. 
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expression of CD34, a surface marker of bulge 
stem cells (Fig. 5C) (/9). This reduction in 
CD34 was observed irrespective of hair cycle 
number or stage. Other stem cell markers that 
we examined were comparably expressed in 
wild-type and KO bulges (fig. S10). 

Although CD34 marks adult stem cells, it is 
not found in embryonic skin progenitors, sug- 
gesting that its reduction could be an indication 
of enhanced proliferative activity within KO 
follicle stem cells. This hypothesis was sup- 
ported by bromodeoxyuridine (BrdU) pulse- 
chase experiments conducted before marked 
deviations in hair cycling (Fig. 5D). Only the 
wild-type follicle bulge compartment retained 
appreciable BrdU label administered at the onset 
of anagen and chased for 4 weeks (/9, 37). By 
contrast, KO hair follicles displayed very few 
label-retaining cells (LRCs), as confirmed and 
quantified by flow cytometry. 

The reduction in label retention was accom- 
panied by enhanced proliferation within the KO 
bulge. After a 4-hour BrdU pulse during full 
anagen, the percentage of S-phase labeled bulge 
cells was twice as high as normal (Fig. 5E and 
fig. S11). By contrast, the number of S-phase 
cells in the interfollicular epidermis/ORS of 


Fig. 5. Lhx2 maintains follicle stem A 
cells in a quiescent, inactive state. (A) 
Histology of wild-type and Lhx2 KO 
skins at 11 weeks after graft. Wild- 
type follicles were still in telogen, 
but KO follicles prematurely entered 
anagen. (B) Upon shaving at 8 
weeks, wild-type hairs did not grow 
back, confirming their resting state. 
In contrast, KO hairs grew back 
within 3 weeks. A small portion (*) 
of the wild-type graft grew hairs 
because of a wound. (C) CD34 ex- 
pression is dramatically reduced in 
KO follicle bulges, irrespective of 
whether they are in telogen or 
anagen. Shown are representative 
immunofluorescence images and 
FACS profiles of telogen follicles at 
8 weeks, and CD34 quantification by 
flow cytometry in telogen and 


wild-type and KO skins was comparable, 
underscoring the specificity of this hyperprolif- 
eration. The elevated proliferative activity of the 
KO bulge did not appear to alter the overall size 
of the stem cell niche. We conclude that without 
Lhx2, follicle stem cells are more readily ac- 
tivated to proliferate and differentiate along 
the hair lineage. On the other hand, Lhx2 is not 
sufficient to induce quiescence, because trans- 
genic expression did not suppress proliferation 
or induce CD34 in the skin epithelium. 

Our ability to isolate and transcriptionally 
profile embryonic hair placodes and interfol- 
licular epidermis has enabled us to substantiate 
genes previously implicated in hair and epi- 
dermal development and to uncover differ- 
ences that could be important in orchestrating 
lineage specification of multipotent skin pro- 
genitors. Lhx2 has served as a paradigm for 
testing this premise, and our studies reveal that 
it functions as a molecular brake in regulating 
the switch between hair follicle stem cell 
maintenance and activation. Although follicles 
can be specified embryonically without Lhx2, 
their overall numbers are reduced, and Lhx2 
null follicles that do form are not proficient in 
maintaining the resting state and precociously 


anagen follicles during the first post- 


natal cycle. (D) Loss of BrdU label 
retention in KO follicles. After a 3- 
day BrdU pulse on days 26 to 28 at 
the onset of anagen in both wild- 
type and KO skin grafts, and a 4- 
week chase when follicles had 
entered telogen (fig. 59), LRCs con- 
centrated in the infrequently divid- 
ing bulge stem cells of wild-type 
follicles, but LRCs were diminished in 
Lhx2 KO skin. Shown are skin 
sections stained with antibody to 
BrdU and results of quantification 


telogen 
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by flow cytometry. (E) Increased BrdU incorporation by KO follicle stem cells. After a 4-hour BrdU pulse at day 
40, when wild-type and KO follicles were in mid-anagen of their first postnatal hair cycle (fig. $9), cells were 
isolated and «6-integrin—-expressing S-phase cells were quantified by flow cytometry. Scale bars, 40 tum. 
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activate. Once committed, cells no longer 
require or express Lhx2 and progress along a 
normal program of terminal differentiation 
(fig. S12). Finally, Lhx2 is the first identified 
marker expressed specifically by both embry- 
onic hair placodes and postnatal follicle stem 
cells of the bulge. Lhx2 now provides a means 
to dissect the transcriptional mechanisms that 
underlie stem cell maintenance within the hair 
follicle. 
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Glutaminyl-transfer RNA (Gln-tRNAS"") in archaea is synthesized in a pretranslational amidation 
of misacylated Glu-tRNAS" by the heterodimeric Glu-tRNA©' amidotransferase GatDE. Here we 
report the crystal structure of the Methanothermobacter thermautotrophicus GatDE complexed to 
tRNA&"" at 3.15 angstroms resolution. Biochemical analysis of GatDE and of tRNA®!" mutants 
characterized the catalytic centers for the enzyme’s three reactions (glutaminase, kinase, and 
amidotransferase activity). A 40 angstrom—long channel for ammonia transport connects the active 
sites in GatD and GatE. tRNAS" recognition by indirect readout based on shape complementarity 
of the D loop suggests an early anticodon-independent RNA-based mechanism for adding 


glutamine to the genetic code. 


he formation of at least 20 sets of 
| aminoacyl-transfer RNA (aa-tRNA) is 
a requirement for protein biosynthesis and 
its accuracy. In most cases, aa-tRNA synthesis is 
achieved by direct acylation of tRNA with the 
cognate amino acid catalyzed by the aminoacyl- 
tRNA synthetases (aaRSs) (7). However, a num- 
ber of aa-tRNAs are made by pretranslational 
tRNA-dependent amino acid modification pro- 
cesses. These include selenocysteinyl-tRNA (2), 
cysteinyl-tRNA in methanogenic archaea (3), and 
amide aa-tRNAs (Asn-tRNA and Gln-tRNA) in 
most prokaryotes (4). These pretranslational 
amino acid modifications represent essential, 
older pathways of aa-tRNA synthesis (5-8) and 
are probably the routes by which these amino 
acids first entered the genetic code (2). Lateral 
gene transfer of an aaRS may have replaced 
these indirect pathways in some organisms (9). 
These pathways also provide the sole route for 
the supply of asparagine in many bacteria (/0) 
and of cysteine in methanogens (3). 

Given the mechanistic unity of the synthesis 
of most aa-tRNAs, it was surprising to discover 
the lack of conservation in Gln-tRNA formation 
(11). The eukaryotic cytoplasm uses direct acyla- 
tion of tRNAG by glutaminyl-tRNA synthetase 
(GlnRS), whereas most bacteria and all archaea 
use a tRNA-dependent transamidation route (//). 
The first step in this route for Gln-tRNA synthesis 
is the formation of misacylated Glu-tRNA®&" by a 
“nondiscriminating” (ND) glutamyl-tRNA syn- 
thetase (GluRS), an aaRS that is also responsible 
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for the formation of Glu-tRNAS™ (12). This 
mischarged tRNA is then converted by amida- 
tion to the properly charged Gln-tRNAS" by a 
Glu-tRNAG" amidotransferase (//, /3). 
Nature has two Glu-tRNAG" amidotrans- 
ferases; archaea employ the heterodimeric GatDE 
enzyme (//), whereas bacteria use heterotrimeric 
GatCAB protein (/3). Each enzyme has one 


Fig. 1. Overall structure 
of the 1:1 complex of 
GatDE and tRNAS'", 
(molecules B, D, and F). 
The protein domains are 
colored differently; in 
GatD, the N-terminal 
domain, AnsA-like do- 
main 1, and AnsA-like 
domain 2 are shown in 
light green, blue, and 
red, respectively; where- 
as in GatE, the AspRS- 
like insertion domain, 
cradle domain, helical 
domain, and Yqey-like 
tail domain are colored 
violet, cyan, orange, and 
pink, respectively. In the 


Cradle domain 
(GatE) 


N-terminal psn 


subunit (GatA or GatD) for ammonia production, 
yet they are structurally different; GatA is homol- 
ogous to amidases (/3) and GatD is similar to a 
type I L-asparaginase (AnsA) (//). The other 
crucial amidotransferase subunit (GatB or GatE) 
belongs to the Pet112 protein family (74), which 
consists only of highly related GatB/GatE pro- 
teins. They catalyze two reactions: (i) the phos- 
phorylation of the y-carboxyl group of glutamate 
in Glu-tRNAS" by a kinase activity (/5), and (ii) 
conversion of the tRNA-bound phosphogluta- 
mate to glutamine by an amidotransferase ac- 
tivity (15, 16) using ammonia generated by GatA/ 
GatD. These amidotransferases may be exam- 
ples of how amino acid—metabolizing enzymes 
were recruited during evolution to aa-tRNA 
formation and thus protein synthesis (4). 
Recently, the crystal structure of the Pyro- 
coccus abyssi GatDE apoenzyme was reported 
(17). It revealed that the GatD and GatE subunits 
form an o.,8, tetramer, and [based on earlier 
biochemical experiments (/5)] residues of the 
glutaminase catalytic site were suggested to be 
located in GatD. This left unresolved the structural 
and biochemical definition of all three active sites 
and their interconnection, as well as the nature and 
specificity of binding the tRNA substrate. Here we 
report the crystal structure of the Methanothermo- 
bacter thermautotrophicus GatDE complexed to 


AnsA-like domain 2 
(GatD) ty 


> ~ < 
=, Y, TEAR 
fs ¥ . Be AnsA-like domain1 
(GatD) 


AspRS-like 


resent structure, the insertion domain 
Yaey.tike tail domain is ~~ at) (GatE) 
shown as translucent be- 
cause its side chains are Anticodon arm 
disordered, despite the 
fact that the main chain tRNA 
was traced in the electron — 
density map. The bound 5 
tRNA molecules are yel- N-terminal AnsA-like AnsA-like 
low. In GatE, His?, domain domain 1 domain 2 
Glu’, and Glu2®, which a - ne 8 ey 
coordinate to an essen- 
tial Mg2+ ion (red), are Cradle domain <oee. Eevee! Tail domain 


shown to highlight the 


Glu-tRNA©" kinase and amidotransferase sites. Gln24°, which recognizes A73 (red), and Asp*63, which 
recognizes G52 (red), are indicated. All figures of the molecular models were prepared with the 


program CueMol (www.cuemolL.org/). 
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tRNA" at 3.15 A resolution. Combined with 
biochemical data, this structure reveals a mech- 
anism of concerted action of the three enzymatic 


activities regulated by GlutRNAS". The struc- 
tural implications of tRNAG” recognition by 
indirect RNA readout based on shape comple- 


A 


D tRNAC" 


GatDE 


tRNAS” tRNA**"G1-C72 TGlu DGlu DAsn U19A A20U 


DGlu 
3.8 


DAsn U19A  A20U 
0.9 0.6 4.5 


tRNA&"" tRNAS” tRNAA™G1-C72 TGlu 


Tansamidase 
Activity (9) 


100 ND ND* 18 68.2 


Fig. 2. tRNA recognition. (A) Recognition of the tRNA acceptor end. (B) Recognition of the tRNA 
TYC stem. (C) Recognition of the tRNA D-loopsT'¥C-loop associating region (in stereo). In (A) to 
(CQ), F, — F, omit maps of the tRNA nucleotides contoured at 5.0, 3.0, and 3.0o, respectively, are 
shown. (D) Gel retardation assay and transamidase activity of tRNAS'", mutants. (Left to right) 
Wild-type tRNA", only; wild-type tRNAS', with GatDE (molecular ratios of 1:0.5, 1:2, and 1:8, 
respectively); tRNAS” without and with GatDE; tRNA‘" without and with GatDE; tRNAS'",, .,, 
without and with GatDE; tRNAS'",..., without and with GatDE; tRNAS'",..,, without and with GatDE; 
tRNAS'",,,.,, without and with GatDE; tRNA®'",,,,, without and with GatDE; and tRNAS"",,,,, without 
and with GatDE. Transamidase activities for the various tRNA species were also tested. The 
percentages shown are relative to the activity using wild-type Glu-tRNAS". ND means no detectable 
activity. The asterisk indicates that the substrate for transamidation was Asp-tRNA‘"", 
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mentarity of the D loop suggest an early 
anticodon-independent RNA-based mechanism 
for adding glutamine to the genetic code. 

As in the P. abyssi GatDE apoenzyme (/7), 
the GatD subunit consists of three domains: the 
N-terminal domain, the AnsA-like domain 1, and 
the AnsA-like domain 2 (Fig. 1). The GatE sub- 
unit is made up of a cradle domain, an AspRS-like 
insertion domain of 123 amino acids, a helical 
domain, and the C-terminal tail domain (residues 
562 to 616) (Fig. 1). This tail domain shares con- 
siderable sequence similarity with the C-terminal 
half of Bacillus subtilis YqeY, a protein of un- 
known function (/8). Its topology [Protein Data 
Bank (PDB) identification number 1 NG6] could 
be fitted into the electron density of the GatE tail 
domain, and the main chain could be traced into 
a three-helix bundle, with the side chains dis- 
ordered. The orientation of the GatE helical and 
tail domains in the tRNA complex is different 
from that in the apoenzyme structure to allow 
interaction with tRNAG" (figs. S1 and $2). 

tRNAG makes contact only with the GatE 
subunit, and the tRNA anticodon does not interact 
with the protein (Fig. 1). This was unexpected, 
because anticodon recognition, as in aaRSs (/9), 
would be sufficient for discrimination between 
tRNAS" and tRNAS". GatE enfolds the acceptor- 
TYWC helix of the tRNA molecule (Fig. 1), as the 
CCA-adding RNA polymerase does (20, 2/). 
The helical domain and the C-terminal tail do- 
main form a concave surface, which snugly fits 
the tRNA elbow region comprising the T¥C arm 
and the D loop (Fig. 1). The helical domain inter- 
acts with the minor groove of the T‘¥C stem, 
whereas the C-terminal tail domain invades the 
T¥C-loop*D-loop assembling region. The heli- 
cal domain folds into a superhelical architecture 
that is structurally similar to the body domain of 
the class-II CCA-adding polymerase (2/, 22); 
these enzymes also recognize the tRNA TC 
arm in a shape-complementary manner (2/). 

GatDE accurately discriminates tRNAS™ 
from tRNAS" and tRNA“, and amidates only 
the misacylated Glu-tRNAS (//). In contrast, 
both Glu-tRNAG and Asp-tRNA“*" are sub- 
strates for GatCAB (23). Because GatB lacks 
the AspRS-like insertion domain found in 
GatE, it was suggested that this GatE domain 
may be an antideterminant against tRNA“ 
(17). However, in the GatDE*RNA©" structure, 
the AspRS-like insertion domain does not con- 
tact the tRNA (Fig. 1), and we suggest a dif- 
ferent role for this domain (see below). 

The tRNAS" ACCA-terminus (positions 73 to 
76) enters the central cleft of the GatE cradle do- 
main. At its entrance, the amide group of Gln?” 
forms a bifurcated hydrogen bond with N3 (3.3 A) 
of the adenine ring and the ribose 2’-OH (2.5 A) of 
the discriminator base A73 (Fig. 2A). This base is 
conserved in the M. thermautotrophicus tRNAS", 
tRNAG", and tRNA“ species (fig. S3). The 
mutation of Gln?*° to Ala resulted in a 2.1-fold 
loss of the amidotransferase activity (Fig. 3C). 
Furthermore, Lys*°! in the cradle domain and 
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Arg*°3 in the helical domain provide electrostatic 
interactions with the phosphate backbones of A73 
and C74 of tRNA®", respectively (Fig. 2A). The 
replacement of Arg°®? with Ala reduced the 
amidotransferase activity by 1.7 fold (Fig. 3C). 

The helical domain interacts with the minor 
groove of the tRNAS'" TC stem (Fig. 1). The 
Tyr® hydroxyl group hydrogen-bonds (2.6 A) 
to the 2’-hydroxyl of G53 in the T¥C stem, 
whereas the 5-amino group of Gln*®” and the 
€-amino group of Arg*7! hydrogen-bond with the 
2’-OH of C62 (Fig. 2B). Arg5°3 electrostatically 
interacts (3.6 A) with the backbone phosphate 
of C62 (Fig. 2B). Base-specific recognition is 
observed for G52, whose 2-amino group 
hydrogen-bonds to the B-carboxyl group of 
Asp*3 (Fig. 2B). Changing this residue to Ala 
reduced amidotransferase activity by 1.5 fold, 
and replacement of Arg*°? with Ala lowered 
the transamidation activity by 1.7 fold (Fig. 3C). 
However, G52, G53, and C62 are common to 
the tRNA@, tRNAS", and tRNA4*" species 
(fig. S3). Thus, the GatDE interactions with the 
TYC stem do not contribute to tRNA discrim- 
ination but may anchor the T‘YC arm to properly 
position the T¥C-loop*D-loop—assembling re- 
gion toward the tail domain. 

The two helices in the tail domain form a 
concave surface to accommodate the T¥C and D 
loops (Fig. 1). Although the side chains are dis- 
ordered, two helices of the tail domain closely 
contact the T'¥C-loop*D-loop—assembling re- 
gion, especially C56 and G18 (Fig. 2C and fig. 
S4). As a result, the Watson-Crick type tertiary 
base pair between C56 and G18 is significantly 
twisted (Fig. 2C). The concave surface formed 
by the two helices prevents the accommodation 
of larger TV¥C and D loops; this appears to be 
crucial for the discrimination of tRNAG from 
tRNAG". Comparison of the relevant tRNA 
sequences (fig. S3) reveals that tRNAS™ pos- 
sesses a D loop that is three nucleotides larger 
than those of both tRNAS" species. This bigger 
loop may act as an antideterminant in tRNAS™ 
recognition. On the other hand, tRNA“*" has a D 
loop of different nucleotide sequence yet similar 
in size to that of RNAS". 

To experimentally determine the basis for 
tRNA" recognition by M. thermautotrophicus 
GatDE, we constructed tRNA mutants [see the 
supporting online material (SOM)] and (i) 
tested tRNA recognition (RNA binding) by 
gel retardation and (ii) determined the tRNA 
identity (19) by assessing the ability of the 
homologous mutant tRNAs to be enzyme 
substrates in the amidotransferase reaction. 
The two methods may produce some differ- 
ences, because (i) measures RNA binding; 
whereas for (i1), productive binding is required. 
Because the structure revealed binding by shape 
complementarity of the D loop, we prepared 
mutants of tRNAG in which its D loop or 
TC loop was replaced with that of tRNAS™ 
and the D loop/stem of tRNAS"™ was replaced 
with that of tRNA“*"; this resulted in the D- 


loop chimeras tRNAG',,,,, and tRNAS", ... 
and the T-loop chimera tRNAG"", Although 


TGlu’ 


there was slightly reduced binding activity by 
the tRNAS'",,, and tRNAS'",, ,. mutants, 
tRNAG, did not bind (Fig. 2D). In contrast 


DGlu 


to the gel-shift results, the Glu-tRNAS",,,, 


and Glu-tRNAS",, , , species could not be trans- 


amidated by GatDE, whereas Glu-tRNA°",,,,, 
supported the transamidation reaction (Fig. 2D). 
Similarly, the glutamylated U19A and A20U 
tRNA®', mutants did not support amidotrans- 


ferase activity, although these mutant tRNAs 


A 


bind to GatDE (Fig. 2D). Thus, the C-terminal 
helical and tail domains of GatE play a key role 
in discriminating tRNAG™ from tRNAS™ by 
shape complementarity as well as by specific 
nucleotide recognition. 

Another significant identity element differ- 
ence between tRNA@", tRNAS!", and tRNAA™ 
might be the first base pair of the acceptor 
helix; this is AleU72 in tRNAG™ and GleC72 
in tRNAS"™ and tRNA“ (fig. S3). The G1*C72 
tRNAS™ mutant showed 50 times less amido- 
transferase activity, with the GatE-binding 


| 
| Mtransamidase 
@kinase 


| 
| B glutaminase 


Fig. 3. Catalysis by GatDE. (A) The Glu-tRNAS"" kinase and amidotransferase center of the GatE 
subunit. The disordered A76, the attached glutamyl moiety, and substrate ATP are modeled in 
yellow. The catalytically essential Mg2*+ ion is represented by a silver ball. Beside the Mg2* ion, a 
molecular tunnel penetrates into the GatD glutaminase site. C74 and C75 of the extended tRNA 
terminus are shown in cyan. Amino acid residues involved in substrate recognition and catalysis are 
indicated. (B) Molecular tunnel of GatDE. The molecular tunnel connects the GatD glutaminase site 
and the GatE kinase/amidotransferase site. GatE is represented by a violet wire model; two GatD 
molecules are represented by cyan and orange wire models. In the GatD glutaminase site, the 
catalytic Thr?°? and Thr?’7 residues are indicated, as well as a modeled t-aspartate, a reaction 
product, which was reported to bind in the apo form of GatDE from Pyrococcus abyssi (17). In the 
GatE active site, the modeled A76 attached with glutamate and substrate ATP are shown in yellow. C74 
and C75 of the tRNA terminus are indicated in cyan. The observed water molecules, or possibly 
ammonia molecules, are indicated by red balls with a diameter of 0.5 A. (C) The glutaminase, Glu- 
tRNAC"" kinase, and amidotransferase activities (colored blue, orange, and green, respectively) of 17 
GatDE mutants. The saturated activities relative to those of the wild-type GatDE are shown. The activity 
measurements were performed three to five times, and the experimental values were averaged. 
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activity unaffected (Fig. 2D). Therefore, the 
Gl1°C72 base pair in tRNA4*" will be a major 
antideterminant for GatDE recognition. 

The tRNAS® ACCA-terminus enters the 
central cavity of the GatE cradle domain; the 
bases of C74 and C75 stack on each other, and 
the O2 atom of C74 hydrogen-bonds to the 
Arg*!? ¢-amino group (Fig. 3A). In contrast to 
C74 and C75, we did not observe electron 
density corresponding to A76. Possibly glu- 
tamylation is a prerequisite to place A76 in its 
binding site. In the vicinity of C75 there is a 
pocket large enough to accommodate glutamyl 
adenosine. 

In the catalytic pocket, Mg** is observed; it 
is coordinated with the strictly conserved His!>, 
Glu!>’7, and Glu!*4 residues. This pocket is ex- 
pected to house the Glu-tRNA©™ kinase and 
amidotransferase activities, and the coordinated 
Mg?* should play a catalytic role in both reac- 
tions. By molecular dynamics, we then modeled 
glutamyl adenosine (position 76) in the catalytic 
pocket (Fig. 3A), so that the adenine ring was 
sandwiched between Arg*!? and Pro*!> (Fig. 
3A), the N6 atom hydrogen bonded with the 
main-chain carbonyl group of Val*!?, and the 
Oe atom of the glutamyl moiety (covalently 
attached to the 2’-OH of A76) was pointed 
toward the Mg?+ in order to be ready for 
phosphorylation. The o-carbonyl and o-amino 
groups of the glutamyl moiety are recognized 
by the conserved amino acids Lys?37 and 
Glu?>, respectively, through hydrogen-bonding 
interactions (Fig. 3A). The phosphate group of 
A76 forms a hydrogen bond with Arg??! in the 
GatE cradle domain, which simultaneously 
interacts electrostatically with the phosphate 
group of C75 (Fig. 3A). The conserved Arg!77 
also provides an electrostatic interaction with 
the A76 phosphate moiety (Fig. 3B). 

Before the amidotransferase reaction, the 
y-carboxyl group of the glutamyl moiety attached 


Fig. 4. Ternary complex 
formation between GatE, 
GluRS, and tRNA@". Dock- 
ing of T. thermophilus 
GluRS complexed with 
tRNAS" (29) onto the 
present GatE*tRNA“" com- 
plex was accomplished 
by superposing the com- 
plexed tRNA structures. 
In GatDE, the AspRS-like 
insertion domain, cradle 
domain, helical domain, 
and tail domain are col- 
ored violet, cyan, orange, 
and pink, respectively. 
GluRS and tRNA are shown in gray and 
yellow, respectively. 


Helical domain 
(GatE) 


Tail domain 
(GatE) 


to A76 is phosphorylated with concurrent 
hydrolysis of adenosine triphosphate (ATP) to 
adenosine diphosphate (ADP). We therefore 
constructed a docking model of GatDE and 
ATP by superposing the ADP/AIF,-bound 
GatB structure (24) onto that of GatE (Fig. 
3A). This docking model is reliable because the 
ATP-binding pocket exhibits 53.6% amino acid 
identity and root mean square deviation of 
0.665 A (28 Ca atoms) between GatE and 
GatB, with the ATP-interacting Pro!8°, Ser??5, 
Asn??7, and Arg? being strictly conserved. 
In this model, the y-phosphate group of ATP 
coordinates to the essential Mg?* (Fig. 3A). 
Thus, His!>, Glu!5’, Glu!*4, the y-carboxyl group 
of the glutamate attached to the tRNA, the 
y-phosphate of the ATP substrate, and a water 
molecule octahedrally coordinate with the 
Mg?* ion (Fig. 3A). This Mg?* coordination 
may accelerate the polarization of the y- 
phosphorus group of ATP and the y-carboxyl 
group of the tRNA-bound glutamate, promoting 
the nucleophilic reaction that forms y-phosphoryl- 
glutamyl-tRNAG", 

To corroborate the assignment of the three 
catalytic sites by functional enzymatic tests 
(glutaminase, kinase, and transamidase activities), 
mutant GatDE enzymes were generated in which 
critical amino acids were replaced by alanine (see 
SOM). Earlier studies (5) showed the essential 
nature of the GatD residues Thr!®!, Thr!77, 
Asn!78, and Lys?>4 for glutaminase activity of 
the heterodimeric enzyme. However, the mutant 
enzymes retained kinase and amidotransferase 
activity when ammonium chloride was the 
amide donor (/5). Analysis of the GatE structure 
assigned the kinase and transamidase active sites 
to the vicinity of the Mg?* with the strictly 
conserved residues His!5, Glu!5?, and Glu!*+ 
(Fig. 3A). The enzymes resulting from mutation 
of these amino acids lost the ATPase and 
amidotransferase activities but kept the glutamin- 


Cradle domain (GatE) 


AspRS-like 
insertion domain 
(GatE) 
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ase function (Fig. 3C). A similar pattern of ac- 
tivities was found (Fig. 3C) in mutant enzymes 
with alterations in other assigned conserved 
amino acids [Arg??!, Lys?37, and Arg*!? (Fig. 
3A) and Arg!7’, Asp??3, and Glu?3> (Fig. 3B)). 
The reduction of glutaminase activity in the 
mutant Arg??!Ala, Glu?3°Ala, and Lys?37Ala 
enzymes supports the observed coupling of the 
GatD glutaminase activity with Glu-tRNAS” 
binding by GatE (/5). 

The GatDE*tRNAS' complex structure 
makes clear that the kinase/amidotransferase 
catalytic site on GatE is located 40 A apart 
from the glutaminase site on GatD (Fig. 1). How 
does ammonia generated at the latter site move 
to the kinase/amidotransferase site on the other 
subunit? In the vicinity of the catalytic Mg?* 
ion, there is a hole (Fig. 3A) that is the exit of 
an intersubunit 40 A-long molecular tunnel, 
which connects the GatD glutaminase site and 
the GatE kinase/amidotransferase site (Fig. 3B). 
This tunnel penetrates one GatE and two GatD 
subunits, the entrance being located on the 
interface of the GatD dimer (Fig. 3B). With an 
average diameter of 2 A, the tunnel should be 
capable of transporting ammonia. The amino 
acids that line the inside of the tunnel have 
hydrophilic properties; a sequence alignment 
reveals that they are strictly conserved in 
archaea. Therefore, we propose that the ammo- 
nium generated at the glutaminase site in GatD 
is transported along the molecular tunnel to the 
kinase center of GatE, where it amidates y- 
phosphoryl-Glu-tRNAS"™, In the present struc- 
ture, we observed several electron densities, 
corresponding to water or possibly ammonium 
molecules along the tunnel (Fig. 3B), which 
suggests that the mechanism of ammonia 
transport resembles that of potassium transport 
by a Kt channel (25). Mutations of two amino 
acids [Leu!78(GatE)Phe and Leu*3°(GatD)Phe] 
located along the molecular tunnel (Fig. 3B) 
significantly reduced the amidotransferase activ- 
ity (Fig. 3C). The Tyr?’3(GatD)Phe mutation, 
situated at the entrance of the tunnel (Fig. 3B), 
reduced the amidotransferase and glutaminase 
activities considerably (Fig. 3C); this suggests 
that Tyr37> plays a role in withdrawing and at- 
tracting the generated ammonia into the tunnel. 
Which residue(s) deprotonate the transported 
ammonium at the exit of the tunnel and direct it 
to react with the phosphorylated glutamate car- 
boxyl group? Sequence and functional analyses 
indicated that Lys!!°, whose €-amino group is 
activated by the adjacent GatE residues Gln!?!, 
Glu!5’, and Glu!*4, and by GatD Tyr°>, is in- 
volved in the deprotonation of the ammonium 
(Fig. 3, B and C). 

Generally, enzymes using glutamine as an 
amide donor are thought to have a molecular tun- 
nel, through which ammonium is transferred from 
the glutaminase site to the second synthetase site 
(16, 26). A recent structural analysis reported that 
glucosamine-6-phosphate synthase opens its mo- 
lecular tunnel by rotating the indole ring of a Trp 
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residue upon the binding of the glutamine 
substrate to the glutaminase site (27). Elucida- 
tion of a similar tunnel-gating mechanism will 
await the structure determination of GatDE 
complexed with acylated Glu-tRNAS"™. 

The presence of misacylated Glu-tRNAS™ 
as the required precursor for Gln-tRNAS™ 
prompted the question of whether there is a 
complex between the ND-GluRS and the Glu- 
tRNA@" amidotransferase, where the misac- 
ylated tRNA would be transferred from the 
synthetase to the amidotransferase by substrate 
channeling (28). This would ameliorate the task 
of EF-Tu in keeping protein synthesis accurate 
by not transporting Glu-tRNAG to the ribo- 
some (29). Preliminary experiments suggest the 
existence of a complex between M. thermauto- 
trophicus GluRS and GatDE; we then docked 
T. thermophilus GluRS complexed with tRNAS™ 
(30) onto the present GatDE*tRNAS™ com- 
plex by superposing the tRNA structures (Fig. 
4). GluRS fits well into the concave surface 
formed by the GatE cradle and the AspRS-like 
insertion domains, and the bound tRNAG!" 
extensively interacts with the AspRS-like in- 
sertion domain with only minor steric clashes. 
A similar docking of the AspRS*tRNA“*? com- 
plex (3/1) did not succeed because of serious 
steric hindrance, especially with the AspRS-like 
insertion domain. Thus, the AspRS-like inser- 
tion domain may facilitate formation of a 
GluRS*GatDE complex while preventing asso- 
ciation with AspRS. As a result, the ternary 
complex would produce and channel only Glu- 
tRNA®", which is the sole substrate for trans- 
amidation by GatDE (//). The presence of such 
a complex could also account for the lack of 
the AspRS-like insertion domain in GatCAB, 
which converts not only Glu-tRNAS" but also 
Asp-tRNA4*" to the corresponding amide 
aa-tRNAs (23). 

The putative GluRS*GatDE complex, in 
which tRNAS interacts simultaneously with 
the ND-GluRS and the AspRS-like GatE in- 
sertion domain, mimics one of the aaRS class 
I/class II/tRNA complexes that have been pro- 
posed as intermediates in the evolution of the 
aaRS classes (32). This had further experimen- 
tal support in the observed ternary complex of 
the two classes of Methanosarcina barkeri 
lysyl-tRNA synthetase (LysRS) and tRNA?! 
(33) and of LysRS1*LysRS2*tRNA"* from 
Bacillus cereus (34). The presence of such a 
ternary complex in transamidation would give 
credence to the idea that this pathway is a rem- 
nant of early code formation. Furthermore, 
unlike temporary aaRSs, GatE mainly uses 
indirect sequence-independent readout for the 
recognition of tRNAG", a mechanism that also 
operates in EF-Tu (35) and tRNA-modifying 
enzymes (36). Taken together, the data suggest 
that a GatDE-like structure and activity were 
appended to an existing GluRS to provide the 
physical and mechanistic means of adding 
glutamine to the genetic code. 
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Ammonia Channel Couples 
Glutaminase with Transamidase 
Reactions in GatCAB 


Akiyoshi Nakamura,’* Min Yao,’* Sarin Chimnaronk,*? Naoki Sakai,” Isao Tanaka’ + 


The formation of glutaminyl transfer RNA (Gln-tRNA®"") differs among the three domains of life. 
Most bacteria employ an indirect pathway to produce Gln-tRNAS" by a heterotrimeric glutamine 
amidotransferase CAB (GatCAB) that acts on the misacylated Glu-tRNA°"". Here, we describe a series 
of crystal structures of intact GatCAB from Staphylococcus aureus in the apo form and in the 
complexes with glutamine, asparagine, Mn2+, and adenosine triphosphate analog. Two identified 
catalytic centers for the glutaminase and transamidase reactions are markedly distant but connected 
by a hydrophilic ammonia channel 30 Ain length. Further, we show that the first U-A base pair in the 
acceptor stem and the D loop of tRNA°" serve as identity elements essential for discrimination by 
GatCAB and propose a complete model for the overall concerted reactions to synthesize Gln-tRNAS". 


ccurate translation of the genetic code 
into proteins requires high efficiency 
and fidelity of tRNA-aminoacylation 
(1). This vital process is governed by a funda- 
mental family of enzymes called aminoacyl- 


tRNA synthetases (aaRS), which catalyze 
the direct attachment of amino acids to the 
3’-ends of the corresponding tRNAs. However, 
Gln-tRNAS"™ is synthesized by two different 
pathways in nature (2-4). In the eukaryotic cy- 
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toplasm and in some bacteria, glutamine is li- 
gated directly to RNAS" by glutaminyl-tRNA 
synthetase (GInRS), whereas a noncanonical in- 
direct pathway is employed in the majority of 
bacteria and all archaea, which do not possess 
GinRS, through a two-step process (2, 5). Dur- 
ing the first step, RNAS is misacylated with 
glutamic acid by a nondiscriminating glutamyl- 
tRNA synthetase (GluRS), which recognizes 
both tRNAS™ and tRNAS®'". Thereafter, the 
mischarged Glu-tRNA©" is transformed into 
Gln-ttRNAS" by Glu-tRNAG"—dependent ami- 
dotransferase (Glu-AdT). Glu-AdT converts 
Glu-tRNAGS into Gln-tRNAS™ by initially 
activating Glu-tRNAS"™ into y-phosphoryl-Glu- 
tRNAG" at the expense of ATP, which is sub- 
sequently transamidated into Gln-tRNAS"™ using 
ammonia generated by hydrolysis of glutamine 
or asparagine. Moreover, Asn-tRNA“* can also 
be synthesized through such an indirect pathway 
by Asp-tRNA“s"-dependent amidotransferase 
(Asp-AdT) in organisms lacking asparaginyl- 
tRNA synthetase (6, 7). 

Bacterial Glu-AdTs are heterotrimeric pro- 
teins composed of A, B, and C subunits, and 
have therefore been named glutamine amido- 
transferase CAB (GatCAB), while archaea em- 
ploy the heterodimeric GatDE enzymes (8, 9). 
Although bacterial GatCAB acts as both Glu- 
and Asp-AdT, GatDE specifically recognizes 
only Glu-tRNAG (9). It is intriguing that the 
three domains in nature use distinct enzymes to 
synthesize Gln-tRNAS™ in protein synthesis. 
Based on the primary sequences analysis, GatA 
is homologous to the amidase enzymes, whereas 
GatD is closer to the L-asparaginases (9, 10). 
Therefore, GatA and GatD are structurally 
different but are assumed to play the same role 
as the glutaminase subunit for ammonia pro- 
duction. On the other hand, GatB is highly 
related exclusively to its archaeal counterpart, 
GatE (9, 10). GatE alone can produce y- 
phosphoryl-Glu-tRNAS"™, which suggests that 
the activation of Glu-tRNAS"™ is achieved only 
by the GatB or GatE subunits (//). Notably, the 
glutaminase activity and the activation of Glu- 
tRNAG" into y-phosphoryl-GlutRNAS™ are 
tightly coupled upon the binding of Glu-tRNAS" 
(11, 12). Hence, these modules of distinct func- 
tional enzymes were assembled to form a more 
sophisticated protein complex and acquired nov- 
el mechanisms to expand the genetic code dur- 
ing evolution. 

Recently, the a,B, tetramer structure of 
archaeal GatDE from Pyrococcus abyssi was 
reported at 3.0 A resolution (0); however, 
the precise mechanisms for each of the 
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overall reactions coupling the glutaminase 
with the kinase and transamidase activities, 
regulated by Glu-tRNAG™, remain elusive. 
To address this issue, we initiated the crystal 
structure analysis of bacterial GatCAB from 
the pathogenic bacterium, S. aureus Mu50 
strain. The crystal structure of apo GatCAB 
was solved at a resolution of 2.5 A, by the 
single-wavelength anomalous diffraction 
(SAD) method, and refined to a final R-factor 
of 23.8% (R-free = 27.5%) using the LAFIRE 
program running with CNS (/3-/5) (Fig. 
1A). The asymmetric unit contained one 
GatCAB heterotrimeric molecule, in which 
the flexible C-terminal region of GatB 
(residues 412 to 475) was disordered and 
could not be modeled. The structure immedi- 
ately highlights an important role of the C 
subunit as a stabilizer of the protein complex. 
GatC is featured by an unstructured extended 
loop flanked by two helices at its N terminus 
and two £ strands at its C terminus (Fig. 1B). 
GatC wraps around the interface region as a 
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belt and makes extensive interactions with 
both GatA and GatB. The amphipathic Cal 
and Ca2 helices form a helical bundle with 
the hydrophobic core of GatA (Aq9-11, 13), 
thus relieving the local hydrophobicity (Fig. 
1C). The internal loop region of GatC crosses 
over the loop-rich side of GatB, stabilized 
through a hydrogen bond network by two 
invariant Arg®+ and Asp% residues of GatC 
(Fig. 1D). The association of GatAB complex 
is further assured by the hydrophobic inter- 
actions of Ca3 helix of GatC with GatAB. 
Interestingly, the C-terminal CB1-2 strands 
make an antiparallel sheet with a B hairpin 
(BB8-9) of GatB, forming a four-stranded 
sheet, resembling the corresponding region in 
GatDE (Fig. 1E). 

GatA is a single-domain protein consisting 
of a central, mixed 11-stranded B sheet core 
covered by double layers of o helices on the 
top and bottom (fig. $2). The highest struc- 
tural similarity was found in malonamidase 
E2 (MAE2) from Bradyrhizobium japonicum 


Fig. 1. Bacterial GatCAB complex fastens a molecular belt. (A) Front-view ribbon diagram of the overall 
structure of S. aureus GatCAB/glutamine complex at 2.3 A resolution, depicted in three different colors for 
each subunit: blue, green, and magenta for GatA, GatB, and GatC, respectively. Glutamine in the active site 
of GatA is drawn as yellow stick representations, whereas the purple sphere is the magnesium ion found in 
the active site of GatB. ADP from the cocrystal structure with ADP-AIF, is shown together as pink sticks. 
This color code is used throughout all the figures. (B) Top view of annularly shaped GatC. (C) Amphipathic 
helices at the N terminus of GatC form a helical bundle with the hydrophobic core of GatA (hydrophobic 
residues are colored gray). (D) Detailed interactions between the internal loop region of GatC and GatAB 
complex. Conserved residues involved are labeled, with hydrogen bonds indicated (<3.2 A, dashed black 
line). Polar interactions are prominent on the C-terminal side, whereas the hydrophobic interactions are 
clustered on the opposite side. (E) The C terminus of GatC tightens GatAB complex, constructing an 


antiparallel 8 sheet and a hydrophobic platform. 
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[Protein Data Bank (PDB) code 1OCK (/6)], 
with a root mean square deviation of 2.1 A for 
405 pairs of Co, atoms compared. GatA faces 
its loop-rich plane to, and caps a loop-rich side 
of, the cradle domain of GatB, burying 7% 
(2680 A2) of the total surface area in the inter- 
subunit interface (Fig. 1A). There is a putative 
ammonia channel (described below) running 
through the middle of the interface, which is 
surrounded by two layers of interfacial inter- 
actions (fig. S1). The inner layer encircling the 
channel is composed of polar residues, whereas 
the outer layer is mostly hydrophobic; there- 
fore, GatA is moderately attached to GatB 
through both hydrogen-bonding and hydropho- 
bic interactions. 

Seeking the molecular mechanism of the glu- 
taminase reaction in GatA, we soaked GatCAB 
crystals in 1 mM glutamine and determined the 
cocrystal structure by molecular replacement. 
The glutamine was found in the center of GatA 
constructed by the amidase signature sequence 
(fig. S2). The amide group of glutamine is rec- 
ognized by Asp*?5, and the carboxyl group is 
recognized by Arg?>8 (Fig. 2A). The side chain 
of the glutamine is located close to the con- 
served Ser—cis-Ser—Lys catalytic scissors: Ser!’8, 
Ser!™4, and Lys’”®. Consistent with MAE2 (J6), 
in GatA, Ser!>4 is located in the loop enriched 
in small residues (glycine, serine, and alanine) 
and forms an unusual cis conformation, which 
enables itself to deprotonate Ser!78. Surprising- 
ly, Ser!78 made a tight covalent bond (1.44 A) 
with the amide carbonyl carbon atom of the 
glutaminyl side chain, revealing the tetrahedral 
covalent intermediate, which is stabilized by 
the oxyanion hole constructed by the backbone 
nitrogen atoms of Thr!7>, Gly!7°, Gly!7”, and 
Ser!78 (Fig. 2A). We further observed a con- 
tinuous electron density sprouting out from the 
amide group of the side chain of glutamine, 
which may correspond to amide ammonia lib- 
erated from the substrate. On the opposite site is 
a water molecule, which was not observed in 
the apo GatCAB crystal, anchored by hydrogen 
bonds with Ser!>?, Ser!78, and the amide group 
of the substrate itself, which is in the vicinity 
of hydrolysis of the enzyme-acyl intermediate. 
These observations clearly indicated that GatA 
employs the same hydrolysis mechanism as the 
amide hydrolysis activity of amidase. Curiously, 
in contrast to the previous study on GatDE from 
Pyrococcus abyssi that showed glutaminase 
activity only in the presence of Glu-tRNAS” 
(0, 11), GatCAB intrinsically possesses basal 
glutaminase activity without binding of sub- 
strate tRNA [in the current structure and by 
kinetics studies (/2)], which suggests that the 
conformational rearrangement of the glutamin- 
ase active site of GatA is not involved in regu- 
lation of the concerted reactions. 

GatB is organized into two domains con- 
nected by a characteristic ~60 A elongated link- 
er loop (fig. S4). The globular “cradle” domain 
is topologically unique and can be distinguished 


from GatE by two insertions on each side of the 
cradle (0). GatE has an insertion domain— 
resembling those found in bacterial AspRS— 
between Ba3 and BB20, and an antiparallel B 
sheet inserted between BB1 and Bal (fig. S3). 
The auxiliary C-terminal helical domain is built 


by an o-helical bundle architecture with seven 
traceable helices. The disordered C-terminal 
region of GatB (residues 412 to 475) is pre- 
sumed to form another three-helix bundle, as 
indicated by the sequence identity to the C ter- 
minus of uncharacterized Yqey protein (PDB 


Fig. 2. A 30 A long ammonia channel connects the two remote active centers of GatCAB. (A) The active 
site of a glutaminase reaction in GatA is composed of the conserved Ser-cis-Ser-Lys catalytic scissors shown 
as magenta stick representations. Residues involved in the hydrogen-bonded network (dashed black lines) 
in the active site are labeled. A plausible hydrolytic water molecule is colored light blue and is on the 
opposite side of a supposed ammonia product (orange sphere). The Fo-Fc electron density map (contoured 
at 30, green mesh) calculated without the glutamine and Ser?7® clearly demonstrates the tetrahedral 
covalent intermediate of the glutamine with Ser?78. (B) The environment of the ADP binding site shown 
together with the omit Fo-Fc electron density map (2c, blue). Residues contributing to ADP (ball-and-stick) 
recognition are represented as stick models with labels. Two water molecules (light blue spheres) are 
coordinated to a magnesium ion (purple) and to B phosphate. (C) The putative ammonia channel was 
calculated using the program CAVER (26), with the structure of the water-omitted GatCAB/glutamine 
complex. Glu??5® blocking the ammonia transport route is shown in a space-filling representation for 
clarity. The channel was filled with a row of solvent molecules (light blue spheres), which interact with the 
conserved polar residues (colored sticks) along the pathway. A bound glutamine in GatA is drawn as 
spheres indicating the start point of the channel. (D) Schematic representation of the ammonia channel. 
Residues defining the channel are colored corresponding to their properties: red, negative; blue, positive; 
black, nonpolar side chain; gray, main chain. Hydrolyzed ammonia is colored orange. Strictly conserved 
residues are underlined and hydrogen-bonding distances are indicated (A). The presumed movement of 
the Glu125B gate to open the ammonia channel is indicated by a black arrow. 
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code, ING6). An obvious density peak of Mg?+ 
ion was observed at the bottom of the cradle 
domain coordinated to three conserved resi- 
dues consisting of His!?, Glu!?4, and Glu!>°, 
and to three water molecules (Fig. 2B). To 
underline the molecular basis for the mecha- 
nism of the kinase activity, we cocrystallized 
GatCAB with the ADP-AIF, compound as 
an ATP analog. The 3.35 A 2Fo-Fc and Fo-Fc 
maps clearly indicated that ADP sits on strands 
BB1 and BB11 at the bottom of the cradle do- 
main, with a bound water molecule (Fig. 2B). 
The adenosine base dips into a hydrophobic 
pocket, created by Val°, Phe*, and Pro!9>. N1 
and N6 are strictly recognized by an invariant 
Ser!%° residue in agreement with the previous 
report that GTP could not serve as a phos- 
phate donor for GatCAB (/7). The hydroxyl 
groups of ribose interact with the backbone 
of a loop between Ba2 and BB10, whereas 
the ribose O04 is recognized by the conserved 
Asn194 residue. The B phosphate is at an ap- 
propriate distance for hydrogen bonding with 
the conserved Glul0 and an anchored wat95 
molecule. 

Cautious inspection of the Fo-Fc difference 
electron density map from a crystal derivatized 
with MnCl, revealed a pair of Mn cations in the 
active site of GatB (fig. S4). The prominent 
Mn?* binding site is identical to the Mg?* site 
in the native crystal. The second transient bind- 
ing site is located adjacent to the first (a dis- 
tance of 6.3 A), coordinated to the conserved 
acidic residue cluster formed by Glu!®, Asp!°?, 
and Glu?!°. This position corresponds to the 
wat95 water molecule in the ADP-AIF,” struc- 
ture and is at an appropriate distance to make 


Fig. 3. Mutational analyses A 
of tRNA®" and GatCAB. (A) 
Cloverleaf representations of 
tRNAC™ (black) and tRNAS"’ 
(gray) from S. aureus. Indi- 
cated bases or stem loops oc 
(magenta) were altered to 
those of tRNAS” to verify 
the binding activity. Bases 
involved in specific recogni- 
tion by GatCAB are empha- 
sized and marked with blue 
stars. (B) Gel-shift assay with 
tRNA“" variants as specified 
in (A). Bands corresponding 
to 150 pmol tRNA® and 50 
pmol GatCAB are shown in 
the first and second lanes, 
respectively. The molecular 
ratios of the enzyme to 
tRNAC" were fixed at 1:3. 


Uncharacterized bands above tRNA were probably responsible for the tRNA 
dimer suggested by the results of size-exclusion chromatography in our 
experiments. Gels were stained with Coomassie brilliant blue and sequentially 
with toluidine blue to visualize proteins and RNA, respectively. (C) A series of 
GatB deletion mutants (50 pmol GatCAB complex per lane) were subjected to 
gel-shift assay in the absence (-) or presence (+-) of 150 pmol of the wild-type 
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interactions with B- and y-phosphates of ATP. 
These results strongly suggested that GatB (and 
plausibly GatE) uses a two-metal-ion mecha- 
nism of catalysis, which is reminiscent of the 
mechanism employed by glutamine synthetases 
in amino acid biosynthesis (/8—2/). We pro- 
pose that the primary Mg?*-binding site should 
be responsible for coordinate bonding, recogni- 
tion, and positioning of the y-carboxyl group of 
the glutamate attached to the 3’ end of tRNAG" 
(fig. S6). On the other hand, the transient Mg?+ 
binding site is likely to interact with B and y 
phosphates of ATP and to participate in phos- 
phoryl transfer by polarizing the y-phosphate 
group of ATP, promoting the nucleophilic 
attack to form a y-phosphoryl-Glu-tRNAS” 
(fig. S6). 

The apo GatCAB structure revealed a dis- 
tance of 30 A that separates the catalytic sites at 
which the glutaminase reaction takes place and at 
which the CCA end of tRNA binds. We found 
that two reaction centers in GatA and GatB are 
connected by a penetrable tunnel lined with a 
continuous succession of conserved polar residues 
(Fig. 2, C and D). In our GatCAB/glutamine 
complex structure, this tunnel is thoroughly hy- 
drophilic and is filled with a row of solvent 
molecules indicating the actual route of ammonia 
traveling from the amino acid pocket in GatA to 
reach the y-phosphoryl-GlutRNAS" in GatB. 
Strictly conserved positive and negative residues, 
designating the ammonia channel, are located 
alternately along the course (Fig. 2, C and D). 
This may imply a “proton-relay mechanism” by 
which ammonia is dispersed from one site to the 
other by repeating protonation and deprotona- 
tion with these successive conserved polar 
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residues. This hypothesis is reconciled by the 
positioning of Thr!7°4 at the entrance of the 
channel and Lys’? at the exit as the proton do- 
nor and acceptor, respectively. However, addi- 
tional biochemical and structural studies should 
be required to clarify the precise mechanism of 
ammonia transport in GatCAB. In addition, 
distinct ammonia tunnels have been described 
in multistep enzymatic proteins that hydrolyze 
glutamine to obtain ammonia as a substrate for 
a second reaction, such as asparagine synthetase 
B, imidazole glycerol phosphate synthase, or 
glutamine phosphoribosylpyrophosphate, as well 
as in the case of the transmembranal ammonia 
transporter proteins (22-25). All of these adopt a 
dissimilar architecture of hydrophobic tunnel 
(~20 A in length), suggesting that ammonia 
(NH,), but not ammonium ion (NH,*), is trans- 
ported. Therefore, GatB is idiosyncratic not only 
in its folding but also in its mechanism of the 
intramolecular ammonia transportation. 
According to the program CAVER (26) and 
an in-house program used in calculation of the 
tunnel of GatCAB, we readily noticed a bottle- 
neck in the middle of the path of ammonia 
travel (Fig. 2C). The channel is constricted by a 
completely conserved Glu!?> residue residing in 
BB7 of GatB, which makes a salt bridge with 
the invariant Lys** on a loop between BB4 and 
BB3. As the efficiency of glutamine hydrolysis 
is known to be stringently dependent on bind- 
ing of Glu-tRNAS", but neither uncharged 
tRNA nor Gln-tRNA®! (12), the Glu-ttRNAS” 
recognition must induce as yet uncharacterized 
conformational changes. We postulate that there 
is an open-close motion of Glu!?> as a gate sus- 
ceptible to tRNA binding. Juxtaposed to Glu!?° is 
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tRNA“, Only Leu*72 harboring A3 mutant (deletion of three residues from C 
terminus of GatB) retained the same binding activity as wild-type (WT) 
GatCAB. Deletion of Leu*?* from the C terminus of GatB (A4 mutants) 
abolished the tRNA-binding capacity. Further deleted mutant (A10), as well 
as a mutant lacking the helical bundle domain (ACHD) of GatB, did not show 
any tRNA-binding activity. 
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a conserved Asp!”° residue, which forms a tight 
salt bridge with the invariant Arg!°° on the 
opposite BB11 strand, resulting in a twist in part 
of the BB7 strand. This Asp!*°—Arg! salt bridge 
is located precisely at the entrance of the active 
site where the CCA end of tRNA will approach. 
It is not unreasonable to assume that the CCA 
end of the incoming tRNA would stimulate 
the movement of Arg!®, resulting in disrup- 
tion of the salt bridge, relaxing the distortion 
of strand B87, and finally opening the glu- 
tamic gate. This scenario might be a shared 
feature between GatCAB and GatDE, because 
these relevant residues are definitely con- 
served in both molecules. 

Our structural data, however, leave the 
nature and specificity of binding the tRNA 
substrate unsolved. We constructed a series of 
the in vitro transcribed S. aureus (Sav) 
tRNAG™ mutants based on the differences 
between tRNAS™ and tRNAS™ and inves- 
tigated the tRNA-binding activity with the 
gel-shift assay (Fig. 3, A and B). GatCAB 
bound quite specifically to tRNAS™ but not 
tRNAG!", Unexpectedly, unlike GluRS or 
GlInRS, GatCAB did not bind or recognize 
the anticodon of tRNA, whereas swapping the 
upper part of the acceptor stem of tRNAS™ to 
tRNAS™ diminished the binding activity to 
GatCAB. Changing only the first U1-A72 
base pair of tRNAS" almost extinguished the 
binding capacity. This is consistent with the 
fact that the first U1-A72 base pair is 
absolutely conserved and unique to tRNAS™ 
and tRNA“*" in bacteria (27) (fig. S5). No 
binding activity could be detected in the case 
of the D-arm swapped mutant. The differ- 
ences in the D arms between tRNAS!" and 
tRNAS"™ are the size of the D loop and the 
base pair in the middle of the D stem (Fig. 
3A). Further mutational analysis of the D arm 
revealed that the U insertion in the D loop 
was lethal to tRNA recognition. These results 
suggested that the morphology of the D loop 
is important for discrimination between 
tRNAS'™ and tRNAS™ by GatCAB. Taken 
together, GatCAB recognizes and discrim- 
inates two identity elements on tRNAG; the 
U1-A72 base pair is exploited as the positive 
determinant to discriminate tRNAS" from the 
other tRNA species, whereas the U insertion in 
the D loop serves as the negative determinant for 
distinguishing tRNAS™ from tRNAS". 

It is unclear how GatCAB discriminates 
the U1-A72 base pair in tRNAS!" from the 
G-C base pair. Interestingly, in the crystal 
structure of Escherichia coli GInRStRNAS™ 
complex, the first Ul-A72 base pair of 
tRNAS was disrupted by the Leu!*° residue 
at the tip of a B turn in the acceptor-binding 
domain of GlnRS (28). GatCAB may use a 
similar mechanism to distinguish A-U from 
G-C base pairs by the smaller free energy 
cost of base-pair denaturation. If this is the 
case with GatCAB, two turn loops, one 


between BB1l and Ba2 and the other be- 
tween BB12 and Bo3, are good presumable 
candidates. On the other hand, we found that 
the C-terminal helical domain (CHD) of GatB 
has a crucial role in binding and discrimina- 
tion of tRNAS™ (Fig. 3C). The mutant with 
deletion of the entire domain completely 
failed to bind to tRNAG'". Notably, deletion 
of the conserved Leu*”? residue, the fourth 
residue from the C terminus, of GatB also 
caused loss of tRNA-binding activity, where- 
as a GatB deletion mutant harboring Leu*7? 
did not impair binding activity. Because doz- 
ens of hydrophobic residues responsible for 
interhelical interactions are highly conserved 
in CHD, we concluded that elimination of 
Leu*”? disrupted the intact configuration of 
the last three helices (which were not visible 
in the electron density map) of CHD. This 
idea is corroborated by the unusually slow 
migration of Leu*’?-deleted mutant protein in 
our gel-shift experiment. This distal three- 
helix bundle is connected to the Ball helix 
by a long, flexible linker partially seen in 
the present structure and expected to be able 
to reach and bind to the D loop of bound 
tRNA". Discrimination of the size or config- 
uration of the D loop of tRNAS™ may be 
achieved by fitting to the concave made by 
helical bundle as a means of an “indirect 
readout” mechanism (29). 

On the basis of our crystal structures 
combined with the evident biochemical data, 
we can extract a simple rationale for the op- 
eration of GatCAB controlling the Glu- 
tRNASG!"—dependent amidotransferase process 
(fig. S6). Glu-tRNAG" is successfully accom- 
modated by GatB through two checkpoints on 
the body of tRNAS'™; the U1-A72 base pair 
and the D loop. Subsequent precise coordina- 
tion of the carboxylic acid moiety of gluta- 
mate, attached to the terminal adenosine of 
tRNA, to primary Mg?*+ could induce local 
conformational remodeling of the active site, 
shifting the Glu!*> gate to open the ammonia 
channel. This motion may slightly accelerate 
the glutaminase activity of GatA because of 
the occurrent flow of ammonia. However, the 
full activation depends strictly on the recruit- 
ment of ATP from the opposite site of the 
active center. Replacement of the wat95 
molecule with the secondary Mg*+ makes the 
active center ready for the transamidase reac- 
tion by preventing y-phosphoryl-glutamate 
from hydrolysis. Consumption of the ammo- 
nia by the transamidase reaction would ro- 
bustly drive the total potential activity of 
GatCAB by pumping ammonia downward 
with a concentration gradient. Our proposed 
model provides a detailed hypothesis to explain 
the previous biochemical and kinetic studies of 
the coupling reaction cycle of GatCAB (//, 12); 
however, further biochemical and structural 
works on GatCAB/Glu-tRNAG!" complex 
may be needed to complete the scenario of 


the reaction scheme of adding glutamine to 
the genetic code. 
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Rapid Advance of Spring Arrival Dates 
in Long-Distance Migratory Birds 
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Several bird species have advanced the timing of their spring migration in response to recent 
climate change. European short-distance migrants, wintering in temperate areas, have been 
assumed to be more affected by change in the European climate than long-distance migrants 
wintering in the tropics. However, we show that long-distance migrants have advanced their spring 
arrival in Scandinavia more than short-distance migrants. By analyzing a long-term data set from 
southern Italy, we show that long-distance migrants also pass through the Mediterranean region 
earlier. We argue that this may reflect a climate-driven evolutionary change in the timing of spring 


migration. 


any biological processes are affected 
Me climate, and in temperate areas the 

increasing spring temperature over 
the past 20 to 30 years has caused an ad- 
vancement of phenological events in plants and 
invertebrates (/, 2). The earlier onset of spring 
has consequences for the timing of breeding in 
birds, which has evolved to match peak food 
availability (3, 4). We may therefore expect the 
timing of breeding to track any temporal shift 
in food availability caused by a trend in spring 
temperature (5). Most passerine birds breeding 
in temperate areas of the Northern Hemisphere 
are seasonal migrants, and the timing of mi- 
gration ultimately constrains when breeding can 
start (6, 7). Short-distance migrants, spending 
the winter close to the breeding grounds, may 
be able to adjust the timing of migration in re- 
sponse to local climate change, which will be 
correlated to the conditions on the breeding 
grounds. In tropical-wintering long-distance 
migrants, the timing of migration is under en- 
dogenous control (8, 9), and the cues needed to 
trigger the onset of migration are unlikely to be 
linked to the climate on their breeding grounds. 
Therefore, it has been assumed that short- 
distance migrants are more likely than long- 
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distance migrants to vary migration timing in 
response to climate change (0). Here we show 
that such an assumption is not empirically 
justified. 

We estimated trends in arrival time for the 
early, middle, and late phases of migration 
(that is, the species- and site-specific 10th, 
50th, and 90th percentiles of the spring arrival 
distribution) in short- and long-distance pas- 
serine migrants, based on long-term banding 
and observational data (from 1980 to 2004) 
from four bird observatories in Scandinavia 
and a site in southern Italy (//, Fig. 1). We 
also investigated whether year-to-year varia- 
tion in arrival time can be explained by short- 
term climate variability as measured by the 
North Atlantic Oscillation (NAO) (/2). As 
explanatory variables we used the calendar 
year (TIME) and the deviations from linear 
regression of the winter NAO index on year 
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[dNAO; the trend in NAO was weakly 
negative over this time period (//)]. Spring 
migration might advance for two distinct rea- 
sons. First, there can be a microevolutionary 
(genetic) response to the selection pressures 
for earlier breeding. Second, the migrants can 
show a phenotypically plastic response to 
trends in weather or climatic patterns on the 
wintering ground and/or along the migration 
route, whereby if spring arrives early on the 
wintering grounds, spring migration will also 
start early. Thus, a response to TIME may re- 
flect either microevolutionary change or pheno- 
typic plasticity, whereas a response to dNAO 
indicates exclusively phenotypic plasticity in 
the migratory behavior. 

Long-distance migrants have advanced 
their arrival in northern Europe in all phases 
of migration (Fig. 2 and tables S1 to S3). The 
advancement in long-distance migrants is 
strongest in the early phase of migration, 
and there is limited variation between spe- 
cies. Furthermore, the analysis of the data set 
from Italy (from the island of Capri) showed 
that long-distance migrants wintering south 
of the Sahara desert are actually arriving in 
southern Europe progressively earlier. In fact, 
all of the nine species analyzed show a trend 
for earlier spring arrival at Capri in most 
phases of migration (Fig. 2 and table S4). The 
long-term trend on Capri is at least as strong 
as that observed in Scandinavia (Fig. 3). In 
short-distance migrants, instead, we find only 
a weak trend toward earlier arrival, and there 
is considerable variation between species 
(Fig. 2 and tables S1 to S3). 

In accordance with previous findings (/3—/5), 
a high NAO index is associated with the early 
arrival of short-distance migrants in Scandinavia, 
but only in the early phase of migration (Fig. 2). 


Fig. 1. The locations of the 
four bird observatories (F, 
Falsterbo, 55°23’N, 12°49’E; 
O, Ottenby, 56°12’N, 
16°24’E; J, Jomfruland, 
58°53'N, 9°37’E; H, Hanko, 
59°48’N, 22°53’E) and of the 
banding site on Capri (C, 
40°33’N, 14°15’E). 
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Fig. 2. Long-term trend (TIME) and the effect of short-term climatic 
fluctuations (dNAO) on the early [(A), 10th percentile], middle [(B), 50th 
percentile], and late [(C), 90th percentile] phases of the spring arrival 
distribution in short-distance migrants (blue circles) and long-distance 
migrants (red squares) in Scandinavia and on Capri (red triangles). The 
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Species-specific slopes were estimated with a mixed-effect linear model (11). The correlation between 
species, when species- and observatory-specific effects of both TIME and dNAO were accounted for, was 
estimated at 0.48 (95% Cl: 0.17 to 0.75), 0.25 (95% Cl: 0.04 to 0.51), and 0.34 (95% Cl: 0.03 to 
0.68) for the 10th, 50th, and 90th percentiles, respectively. 


On the other hand, most long-distance migrants 
tend to arrive earlier in Scandinavia during 
years of high NAO in all phases of migration 
(Figs. 2 and 3 and tables S1 to S3). The op- 
posite pattern is observed at Capri, where high 
NAO tends to delay arrival times (Figs. 2 and 
3 and data in table S4). The underlying reason 
for this may be found south of the Sahara 
desert, because a high NAO index harms 
productivity over vast areas of northwestern 
and southeastern Africa (16), which may delay 
the spring departure of migrants from sub- 
Saharan wintering areas. 

By showing that long-distance migrants 
have advanced their migration more than 
short-distance migrants, we have challenged 
the conventional wisdom that species 
wintering in temperate Europe should respond 


more strongly to climate change than trans- 
Saharan migrants (/0). Furthermore, the ear- 
lier arrival of trans-Saharan migrants at Capri 
shows that the temporal trend for earlier 
arrival in Scandinavia cannot be explained 
simply by faster migration through Europe in 
response to a concomitant trend of increasing 
temperatures taking place within continental 
Europe (/7). Instead it suggests that (i) the 
onset of migration has advanced, or (ii) the 
speed of migration through Africa has 
increased. Both alternatives could be seen as 
phenotypic responses to trends in the African 
climate patterns having a positive effect on 
the foraging conditions (/8), thereby im- 
proving the birds’ physical conditions, which 
in turn affects their timing of migration (/9) 
and makes the migration (including flight 
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solid symbols are sample averages, and the ellipses delimit their 95% 
confidence regions (11). Estimates for each species are given in tables S1 to 
S4. The differences in effect size for early-phase arrival of short-distance 
migrants versus long-distance migrants was 0.15 [95% confidence interval 
(Cl): 0.06 to 0.23] days per year for the effect of TIME. 


and stopover) more efficient. A positive trend 
in African temperatures (20) has previously 
been suggested as a reason why long-distance 
migrants arrive earlier in northern Europe 
(21). However, increasing African temper- 
atures should decrease productivity (22), 
thereby delaying long-distance migrants’ depar- 
ture from the wintering ground. Hence, the 
earlier arrival is probably not a phenotypic 
response to improved foraging conditions. 
More likely, the rapid advance in arrival dates 
of long-distance migrants in Europe is due to 
climate-driven evolutionary changes in the 
timing of spring migration. Even though 
migratory activity is under endogenous control, 
experiments have demonstrated individual var- 
iation in the response to the photoperiodic cues 
needed to trigger the mechanisms underlying 
the onset of migration (23). The passerine birds 
investigated here reproduce at just 1 year of age 
and thus have the potential for a rapid evo- 
lutionary response to environmental changes. 
Given the considerable heritable genetic varia- 
tion in the timing of migration (24, 25) and the 
selection pressure to breed earlier in Europe 
(6, 7), a change toward earlier arrival is indeed 
to be expected. 
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Intron Removal Requires 
Proofreading of U2AF/3' Splice Site 
Recognition by DEK 


Luis Miguel Mendes Soares,’ Katia Zanier,* Cameron Mackereth,* 


Michael Sattler,* Juan Valcarcel™:3* 


Discrimination between splice sites and similar, nonsplice sequences is essential for correct 
intron removal and messenger RNA formation in eukaryotes. The 65- and 35-kD subunits of the 
splicing factor U2AF, U2AF®> and U2AF?°, recognize, respectively, the pyrimidine-rich tract and the 
conserved terminal AG present at metazoan 3’ splice sites. We report that DEK, a chromatin- and 
RNA-associated protein mutated or overexpressed in certain cancers, enforces 3’ splice site 
discrimination by U2AF. DEK phosphorylated at serines 19 and 32 associates with U2AF?°, 
facilitates the U2AF?°-AG interaction and prevents binding of U2AF®> to pyrimidine tracts not 
followed by AG. DEK and its phosphorylation are required for intron removal, but not for splicing 
complex assembly, which indicates that proofreading of early 3’ splice site recognition influences 


catalytic activation of the spliceosome. 


minimal U2AF heterodimer consisting 
Ac RNA recognition motifs (RRM) 1 


and 2 of U2AF® (J) and the U2AF 
homology motif (UHM or ‘YRRM) of U2AF?° 
(2) was analyzed by nuclear magnetic resonance 
(NMR) spectroscopy in the absence or presence 
of an RNA containing a pyrimidine tract 
followed by a consensus 3’ splice site (3’ss) 
[5 (U),,ACAGG 3’]. As expected from the af- 
finity of U2AF® for uridine-rich sequences (/), 
the presence of the RNA caused extensive changes 
in the NMR spectrum of the U2AF® RRM 1+2 
subunit (Fig. 1A, left). In contrast, small pertur- 
bations concerning few residues were observed 
in the U2AF* ‘YRRM spectrum (right). The 
latter was unexpected, because previous obser- 
vations suggested that U2AF%> specifically 
recognizes the 3’ss-AG (3-5). Gel retardation 
assays using 32P-uridine-labeled RNAs [5’ 
GGG(U), ,AC-AG/CG-GUAAAAUAACUCA 
3’] showed that, although U2AF35 ‘RRM in- 
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creases the affinity of the complex threefold, the 
effect is similar for AG-, CG-, UG- or AA-3’ss, 
strong or weaker pyrimidine tracts (Fig. 1B and 
figs. S1 and S82). Lack of AG discrimination 
was also observed when different assays and 
recombinant full-length U2AF heterodimer or 
U2AF purified from HeLa cells were utilized 
(Figs. 1, C and D). In contrast, both endoge- 
nous U2AF and the minimal heterodimer 
showed preferential ultraviolet (UV) light— 
induced photo-cross-linking of U2AF® to 
AG-3’ss RNAs in nuclear extracts (Fig. 1E). 
Reconstitution of U2AF-depleted extracts with 
recombinant U2AF subunits indicated that 
U2AF*> is required for AG discrimination (Fig. 
1E, bottom). The presence of U2AF*° and other 
components of the nuclear extract decreased 
cross-linking of U2AF® to the nonconsensus 
CG-3’ss, which suggests the existence of a proof- 
reading activity that enforces specific associa- 
tion of U2AF with pyrimidine tracts followed 
by consensus AG-3’ss. 

This activity cofractionated with U2AF during 
the two first chromatographic steps of U2AF 
purification (6) (fig. S3). In fig. 2A, compare lanes 
3 and 4 with 7 and 8 for the U2AF-containing 
complex (identified in lane 2 by supershift 
with antibodies against U2AF®>). The activity 


was, however, separated from U2AF on the next 
chromatographic step [poly(U)-Sepharose]; 
whereas U2AF was retained in the column (6), 
the flow-through fraction provided AG versus 
CG discrimination to the truncated heterodimer 
in both UV-mediated cross-linking (Fig. 2B) 
and gel-retardation assays (fig. S4). The activity 
present in this fraction was retained on an af- 
finity column containing the truncated U2AF 
heterodimer (Fig. 2D, lanes 1 to 4). Comparison 
of the protein profiles of the input and flow- 
through fractions revealed that a 50-kD protein 
was retained in the U2AF column (Fig. 2C, 
lower component of the 50-kD doublet). Mass 
spectrometry analyses identified this protein as 
DEK, a chromatin-, pre-mRNA- and mRNA- 
associated protein overexpressed or mutated 
in certain cancers (7, 8). Consistent with a role 
for DEK in providing AG discrimination to 
U2AF, depletion of DEK from HeLa nuclear 
extracts (fig. S5) resulted in reduced AG 
versus CG discrimination by endogenous 
U2AF® (Fig. 2E, lanes 1 to 4), an effect that 
was reversed when recombinant purified DEK 
was added to the depleted extracts (lane 5). 
Cross-linking between U2AF?> and an RNA 
radioactively labeled at the 3’ss dinucleotide 
(A-[??P]-G or C-[32P]-G) was reduced in DEK- 
depleted extracts, which indicated that DEK is 
required for 3’ss recognition by U2AF?5 (Fig. 
2F). Collectively, the results described above 
indicate that DEK provides a proofreading func- 
tion that allows U2AF to discriminate between 
bona fide AG-containing and nonconsensus 3’ss 
regions. 

DEK retention in U2AF affinity columns 
suggested the possibility of an interaction be- 
tween these factors. Pull-down experiments 
using in vitro translated, 3°S-labeled U2AF® or 
U2AF>> and recombinant purified glutathione 
S-transferase (GST)-DEK revealed formation of 
a complex between DEK and U2AF>, which 
was, at least in part, RNA-independent and 
involved the 100 amino-terminal residues of 
DEK (Fig. 3, A and B). Interestingly, the in- 
teraction was disrupted by phosphatase treat- 
ment (Fig. 3B, lanes 3 versus 4 and 11 versus 
12), which suggests the requirement for protein 
phosphorylation. Indeed, DEK is a phospho- 
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protein (9), and phosphorylation occurs upon 
incubation of DEK(1—100) with rabbit reticulo- 
cyte lysates (fig. S7). Mutation of serines 19 and 
32 to alanine (A) or aspartic acid (D) abol- 
ished phosphorylation (fig. S6). Interaction with 
U2AF*° was abolished by mutation to alanine; 
mutation to the phosphorylation mimic, aspartic 
acid, allowed the interaction to occur even in 
the presence of phosphatase (Fig. 3B, lanes 5 to 
8 and 13 to 16). Taken together, the results of 
Fig. 3 reveal formation of a complex involving 
the amino-terminal 100 amino acids of DEK and 
U2AF*>, which is dependent on phosphorylation 
of serines 19 and 32. The involvement of ad- 
ditional factors in complex formation cannot be 
tuled out, because NMR experiments failed to 
detect significant DEK-induced changes in 
U2AF°> structure or interaction with RNA. 
Recombinant DEK was found insufficient to 
confer discrimination to the purified heterodimer 
(Fig. 2D, lanes 5 and 6). Discrimination was 
restored, however, when DEK was combined 
with the DEK-depleted flow-through fraction 
from the U2AF affinity column mentioned above 
(Fig. 2D, lanes 7 and 8), but not with other 
chromatographic fractions (fig. S3B). The re- 
quirement for adenosine triphosphate (ATP) (see 
below), for U2AF*° (Fig. 2D, lanes 9 and 10) 
and the phosphorylation-dependent association 
of U2AF*° suggested the possibility that DEK 
phosphorylation by a kinase present in the 


Fig. 1. An activity present in A 
nuclear extracts is necessary 

for 3’ss AG discrimination by 8: 
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complementing fraction was necessary for AG 
discrimination by U2AF. Consistent with this 
hypothesis, DEK was phosphorylated in the 
presence of the complementing fraction (fig. 
S6), and mutation of serines 19 and 32 to 
alanine compromised the discriminatory activity 
provided by DEK, whereas mutation to aspartic 
acid maintained the activity in reconstituted 
assays (Fig. 2D, lanes 11 to 14) and in 
complementation of DEK-depleted extracts 
(Fig. 2E, lanes 6 and 7). Part of the discrimina- 
tion provided by the mutant with aspartic acid at 
positions 19 and 32 was lost in the absence of 
ATP (lanes 15 and 16), which suggests that 
phosphorylation of serines 19 and 32 is neces- 
sary, but not sufficient, for full DEK activity. 
Collectively, the results of Figs. 2 and 3 indicate 
that DEK phosphorylation facilitates its associ- 
ation with U2AF°>, and this correlates with the 
activity of the protein to confer AG discrimina- 
tion to the U2AF heterodimer. 

One possible consequence of the absence of 
DEK proofreading would be splicing activation 
of pre-mRNAs containing mutations at the 3’ss 
AG. This was found not to be the case for ei- 
ther IgM pre-mRNA (which requires the 3’ss 
AG to undergo both catalytic steps of splic- 
ing) or AdML (which undergoes—at least to 
some extent—the first catalytic step in the 
absence of 3’ss AG) (Fig. 4A). DEK deple- 
tion, however, inhibited the catalytic steps 
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NMR spectra of 7H,75N amide 
resonance correlations of 
U2AF6 RRM 14-2 (left) and 
U2AF25 RRM (right) form- 
ing a minimal U2AF hetero- 
dimer. Spectra in the absence 


(black) or presence (red) of 
a twofold molar excess of 
5/(U),,ACAGG 3° RNA are 
shown. (B) Gel retardation 
assays using the indicated 
proteins (10~° M for lanes 


1 and 8, and serial three- . 

fold dilutions) and 10 fmol 

of either 5’ 32P-labeled AG 
RNA used in (A) (AG) or a U2AF® + : 


cn 
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AG by CG (CG). Positions of 
unbound RNA and com- 
plexes are indicated. (C) UV- 
induced cross-linking assay 
using GST-U2AF&> (2 x 
1077 M), His-U2AF25 (5 x 
10-7 M) and ?2P-uridine— 
labeled 5’ GGG(U),,AC-AG/ 1 2 
CG-GUAAAAUAACUCA 3’ 

RNAs (10~° M). (D) Gel re- 


tardation assay using purified U2AF heterodimer from Hela cells (10~® M for 
lanes 2 and 7, and serial twofold dilutions). (E) UV-induced cross-linking and 
immunoprecipitation of endogenous U2AF and truncated heterodimer 
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requiring AG recognition in wild-type RNAs: 
first step for the AG-dependent and, mainly, 
second step for the AG-independent pre- 
mRNAs (Fig. 4B). Splicing was restored by 
complementation with recombinant purified 
DEK and the DEK mutant with Asp!° and 
Asp*?, but not with the DEK mutant with 
Ala! and Ala** (Fig. 4, B and C, and fig. $7). 
U2AF was not depleted in DEK-depleted 
extracts (fig. S5), and splicing was not re- 
stored by an excess of U2AF® or U2AF het- 
erodimer (Fig. 4C). Collectively, these data 
establish a correlation—further substantiated 
by the common requirement for phosphorylation 
of Ser!® and Ser??—between three properties 
of DEK: associating with U2AF35, proofread- 
ing U2AF/3’ ss recognition, and sustaining pre- 
mRNA splicing. 

In contrast, neither U2 small nuclear ribonu- 
cleoprotein (snRNP) binding nor subsequent 
events in spliceosome assembly were affected 
by DEK depletion (Fig. 4D), which suggests 
that the absence of proofreading at early stages 
of 3’ss recognition influences catalytic activation 
of splicing complexes. Effects of DEK deple- 
tion on splicing complex formation were, 
however, observed in competition assays in 
which U2 snRNP binding to a 3’ss-containing 
RNA was competed by an excess of RNAs 
containing a pyrimidine tract followed by AG 
or CG. Although only AG-containing RNAs 
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subunits (2.5 x 1075 M) in the presence of HeLa (NE) or U2AF-depleted 
(AU2AF NE) nuclear extracts (33% of reaction volume). Ratios between cross- 
linking intensities for AG- and CG-containing RNAs are indicated (n = 3). 
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were able to compete in mock-depleted extracts, 
both AG- and CG-containing RNAs were 
effective competitors in DEK-depleted extracts 
(Fig. 4E and fig. S8). These results are con- 
sistent with the notion that DEK enforces selec- 
tive functional association of U2AF with bona 
fide 3’ss. 

Alterations in splice-site recognition and in- 
tron removal could underlie the molecular basis 


of pathologies involving DEK overexpression 
or inactivation, including autoimmune disease 
and cancer (7). Consistent with this, overex- 
pression of mutant versions of DEK in cells in 
culture results in reduced splicing efficiency of 
model pre-mRNAs (fig. S9). Previous data 
from iterative selection of sequences from a 
random pool (3) indicated that the U2AF het- 
erodimer is sufficient to select both pyrimidine 
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tracts and consensus 3’ss AG. DEK may en- 
hance this intrinsic preference, revealed to dif- 
ferent extents depending on assay stringency. 
DEK binding could induce a conformational 
change in U2AF*> that enforces AG selectivity 
(Fig. 2F), similar to the enhanced specificity of 
U2A’ for U2 snRNA sequences upon interac- 
tion with U2B” (/0). Elegant studies in yeast 
implicated the transient interaction of the RNA- 
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and recombinant proteins (2.5 x 1076 M). Input: pUS FT, as in 
(B); FT, material unbound to U2AF affinity column; wt, GST- 
DEK; 19,32 A and 19,32 D: DEK mutants with serine replaced 
by alanine (A) or aspartic acid (D) at these positions. ATP was 
present in all assays except where indicated. Ratios between 
cross-linking intensities for AG- and CG-containing RNAs are 
indicated (n = 3). (E) UV-induced cross-linking and immuno- 
precipitation assays as in (B), using mock- or DEK-depleted 
nuclear extracts, complemented with 100 ng/ul of purified 
recombinant DEK or mutant derivatives. (F) Cross-linking and immunoprecipitation assays of U2AF35 carried out as in (E) using RNAs 32P-labeled at ApG or CpG 
3’ss and antibodies against U2AF3°. 
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material. Ribonuclease (RNase) treatment and positions of in vitro translated 
products are also indicated. (B) Pull-down experiments as in (A), using GST 
fusions of DEK (residues 1 to 100) and the indicated mutant derivatives. CIP 
indicates treatment with calf intestine alkaline phosphatase. 


Fig. 3. Phosphorylation-dependent association between DEK and U2AF?°. 
(A) Pull-down of 7°S-labeled in vitro translated U2AF subunits by the 
indicated GST fusion proteins (PUF-60 and TIA-1 are splicing factors used as 
negative controls). P, precipitation products; SN, 1/10 of nonprecipitated 
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Fig. 4. DEK is required for pre-mRNA splicing but not for spliceosome assembly. (A) In vitro splicing assays using mouse 
immunoglobulin IgM or adenovirus major late (AdML) promoter pre-mRNAs in which the 3’ss AG was mutated to GA, in mock- 
or DEK-depleted nuclear extracts. (B) Splicing assays as in (A) using wild-type pre-mRNAs, in the absence or presence of 
recombinant purified DEK (30 ng/ul). The positions of pre-mRNA, splicing intermediates, and products are indicated. AdML 
(bottom) shows a clearer effect on lariat intermediates and products. (C) Splicing complementation assays as in (B), 


recombinant DEK; DEK Ala (A) and Asp (D) mutants at serines 19 and 32; recombinant U2AF®> (5, 10, and 15 ng/l); and purified HeLa U2AF 
heterodimer (10 ng/ul). Quantification of splicing efficiency from three independent experiments is shown. (D) Spliceosome assembly corresponding to 
splicing assays as in (B). IgM 3’, RNA containing 3’ half intron and exon 2 of IgM pre-mRNA. Positions of heterogeneous nuclear RNP (H) and 
spliceosomal (A and B/C) complexes are indicated. (E) Spliceosome assembly assays as in (D) using IgM 3’ RNA in the presence of 50, 200, and 400 
molar excess of unlabeled RNAs containing AG or CG at the 3’ss as in Fig. 1B. 


dependent adenosine triphosphatase (ATPase) 
Prp16 with the spliceosome as a timing device 
that enforces kinetic proofreading of lariat 
intermediates before the second catalytic step 
(11). An RNA-dependent ATPase could facil- 
itate DEK-mediated displacement of U2AF 
from pyrimidine tracts not followed by AG 
3’ss, as reported for other RNA-protein inter- 
actions (/2). Kinetic proofreading could explain 
why catalytic activation of the spliceosome 
requires discrimination between bona fide and 
cryptic 3’ss during early splicing complex 
formation. Phosphorylation affects DEK ac- 
tivity in 3’ss recognition and splicing, which 
suggests that casein kinase II (/3) or other 


1964 


kinases can influence the selectivity of 3’ss 
identification and possibly alternative splicing. 
Phosphorylation could also cause a switch in 
DEK partners, including those that mediate its 
association with chromatin or the transcrip- 
tional machinery (7); this may underlie aspects 
of the coupling between transcription and pre- 
mRNA processing (/4). 
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The Ant Odometer: Stepping on 


Stilts and Stumps 


Matthias Wittlinger,’* Riidiger Wehner,” Harald Wolf* 


Desert ants, Cataglyphis, navigate in their vast desert habitat by path integration. They 
continuously integrate directions steered (as determined by their celestial compass) and distances 
traveled, gauged by as-yet-unknown mechanisms. Here we test the hypothesis that navigating 
ants measure distances traveled by using some kind of step integrator, or “step counter.” We 
manipulated the lengths of the legs and, hence, the stride lengths, in freely walking ants. Animals 
with elongated (“stilts”) or shortened legs (“stumps”) take larger or shorter strides, respectively, 
and concomitantly misgauge travel distance. Travel distance is overestimated by experimental 
animals walking on stilts and underestimated by animals walking on stumps. 


oraging Saharan desert ants, Cataglyphis 

fortis, use a mode of dead reckoning 

known as path integration (/, 2) to moni- 
tor their current position relative to the nest and 
to find their way home (3). This enables them 
to return on a direct route, rather than retracing 
the tortuous outbound journey performed when 
searching for food items in their flat desert 
habitat, which is often completely devoid of 
landmarks. The path integrator requires two 
kinds of input information: about directions 
steered, as obtained via the ant’s celestial com- 
pass (4), and about distance traveled, as gauged 
by the ant’s odometer. 

The cues by which ants measure travel 
distance during locomotion have not yet been 
discovered. However, there are several prom- 
ising hypotheses. The “energy hypothesis” 
posits that the (surplus) energy required for 
locomotion (as opposed to rest) is used to 
calculate travel distance. This hypothesis is of 
long standing in arthropod research (5), but is not 
applicable to the problem of the ant odometer, 
because ants assess their walking distances with 
great accuracy, irrespective of the load they 
carry (6). The “optic flow hypothesis” has been 
proven in honeybees, which integrate visual 
flow-field cues during their foraging flights to 
gauge flight distance (7, 8). In Cataglyphis ants, 
this mechanism plays a minor role, if it has any 
effect (9) (for further discussion of the optic flow 
hypothesis, see SOM). Even in complete dark- 
ness, on featureless platforms, or with the ventral 
halves of their eyes covered, the animals are still 
able to assess traveling distance correctly during 
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their homing runs (9, /0). Also, lateral optic 
flow does not have any influence at all on dis- 
tance estimation (//). Considering the relatively 
constant locomotor speed of desert ants, a time- 
lapse integrator might function to measure walk- 
ing distance—although this possibility has been 
refuted in slightly different experimental con- 
texts (/2). Thus, ants appear to rely primarily on 
idiothetic cues, most probably derived from the 
movements of their legs (13). Although this step 
integrator or “pedometer” hypothesis was ini- 
tially proposed as early as 1904 (/4), it has re- 
mained untested. Here, we examine whether or 
not ants with manipulated leg lengths, walking 
on stilts or on stumps, exhibit changes in their 
stride lengths and, consequently, misgauge their 
travel distance during homebound runs. 

The ants were trained to walk from their nest 
entrance to a feeder, over a distance of 10 m and 
in a linear alloy channel (Fig. 1, top). After at 
least 1 day of training, the animals were caught 
at the feeding site and transferred to a test 
channel, aligned parallel to the training channel 
(Fig. 1, bottom). Once transferred into this test 
channel, the ants performed their homebound 


Feeder Nest 
v v 
Training 


Point of release 


Homebound run 
—_—_—_ 


Turn 2 
trajectory +. 4 > Turn 3 


Fig. 1. Schematic diagram of channel layout, as 
well as training and testing procedures applied in 
Cataglyphis foragers (not drawn to scale). 


Search 


runs, and we recorded the point at which the 
ants switched from their straight and steady 
return path to their characteristic nest-searching 
behavior. This point is marked by a 180° U-turn 
(15), followed by a run pacing back and forth 
around the anticipated location of the nest 
entrance. Details of the experimental procedure 
are given in (/6). 

Ants that had reached the feeder on a 
foraging trip through the training channel were 
caught and subjected to experimental manipu- 
lation. To increase stride length on the animals’ 
homebound runs, their legs were splinted and 
extended with pig bristles glued to the tibia and 
tarsus, as illustrated in Fig. 2 (stilts). To de- 
crease stride length, the legs were shortened by 
severing the tarsomeres in the middle of the 
tibia segment (Fig. 2, stumps). Operated ani- 
mals were supplied with a food item and trans- 
ferred to the test channel, with the food item in 
their mandibles (“test 1”). These ants started 
determined homeward runs, and when they had 
covered the assumed distance to the nest, they 
switched to the nest-searching behavior out- 
lined above. The ants walking back homeward 
on stilts clearly overshot [15.30 m, interquartile 
range (IQR) = 3.24 m, n = 25], whereas ants 
with shortened legs undershot (5.75 m, IQR = 
1.81 m, n = 25) with regard to their normal 
homing distance (10.20 m, IQR = 2.40 m, n = 
25) (Fig. 3A). There are statistically significant 


Normal 


Stilts 


Fig. 2. Manipulation of ant legs as performed in 
the present study. In stilts, attached pig bristles 
elongated the legs; second, normal unmodified legs, 
with approximate range of tarsus movement in- 
dicated; third, shortened legs (stumps). The right 
hind leg is shown from anterior [see figure 1 in (28)]. 
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differences (P < 0.001, Kruskal-Wallis test) 
among these groups. 

The modified and tested ants were put back 
into their nest and were tested again when they 
turned up at the feeder during one of the 
following days (“test 2”). Having already per- 
formed their outbound runs to the feeder on stilts 
or stumps, these animals exhibited homing runs 
almost identical to those of normal, unmodified 
ants (Fig. 3B). Ants walking on stilts (stilts, 
n = 25) searched for the nest at 10.55 m (IQR = 
1.45 m) distance from the release point and 
ants with shortened legs at 10.25 m (IQR = 
1.76 m) (stumps, n = 25). No significant dif- 
ferences were observed between these groups 
or with regard to unmodified animals (above, 
10.20 m). 

These results support the hypothesis that 
desert ants use a pedometer for distance mea- 
surement, or a step integrator [loosely speaking, 
a step counter, although the ants most probably 
do not literally count (/7)]. According to the 
pedometer hypothesis, ants that have traveled to 
the feeder on normal legs and have had their 
leg length modified at the feeder should cover a 
different distance on their homebound journey. 
This is because the same number of strides made 
during the outbound travel, as registered by the 
step integrator, should presumably carry them 
over longer (stilts) or shorter (stumps) distances, 
respectively. And later, on reemerging from the 
nest with their legs still modified, the number of 
strides will be the same again for outbound and 
inbound travel (although different from the 
situation with normal legs in both cases). No 
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such straightforward interpretation is possible 
for any of the competing hypotheses. 

This interpretation rests on the assumption, 
however, that stride length is indeed altered 
by the manipulation of leg length, roughly in 
proportion to the imposed length changes. Thus, 
a quantitative assessment of the (manipulated) 
ants’ stride lengths and their relation to walking 
distances is also necessary. It was impossible to 
count the ants’ strides during experiments (for 
instance, by filming the complete outbound and 
homebound journeys on high-speed video). 
Instead, stride lengths were determined in a 
different set of animals, both normal and 
manipulated, in a small pen erected on the desert 
floor. As in most animals, stride length in nor- 
mal Cataglyphis depends on at least two pa- 
rameters. The first is leg length and, thus, body 
size, as body morphology is isomorphic in 
Cataglyphis fortis workers (18). That is, larger 
ants take correspondingly longer strides. The 
second is walking speed. The faster an ant runs, 
the larger are the strides it takes (19-21). Nor- 
malization of homebound travel distances with 
regard to both body size and walking speed is 
thus required for a quantitative interpretation of 
the above experiments. High-speed video record- 
ings of running ants were analyzed to determine 
the actual changes in stride lengths caused by the 
stilts and stumps manipulations described above. 
Walking speeds were determined in the above 
experimental animals by the time required for 
uninterrupted straight segments of homebound 
travel (and confirmed on the high-speed movies 
in a different set of animals). 
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Fig. 3. Homing distances of experimental ants, tested immediately after the lengths of their legs had 
been modified at the feeding site. (A) Leg lengths were normal during the outbound journey but 
manipulated during the homebound run, resulting in different homing distances. (B) Ants tested after 
reemerging from the nest after previous manipulation. In this situation, leg lengths were equal, 
although manipulated, during outbound and homebound runs. Box plots show median values of the 
homing distances recorded in n = 25 ants per experiment (as well as IQRs, box margins, and 5th and 
95th percentiles, whiskers). Median values of the initial six turning points of an ant’s nest-search 
behavior were considered as the centers of search, indicating homing distance. The hatched box plots in 
(A) illustrate the centers of search as predicted from the high-speed video analyses of stride lengths in 
normal and manipulated animals. The open box represents the prediction corrected for slow walking 


speed. Details in text. 


To derive a quantitative prediction of how 
far the manipulated ants would have to run on 
their homebound journey, based on the pedom- 
eter hypothesis, the experimental data were first 
normalized with regard to body size. That is, 
relative stride lengths with regard to body di- 
mensions were calculated to eliminate effects of 
size variation. Second, the actual stride lengths 
were determined in normal and manipulated 
ants. Ants walking on stilts took significantly 
longer strides (P < 0.01, ¢ test; 14.8 mm, SD = 
+ 2.5 mm, or +13.9%) than did normal ani- 
mals (13.0 mm, SD = + 1.98 mm), and ants on 
stumps made significantly shorter strides (P < 
0.001; 8.6 mm, SD = + 1.73 mm, or —33.2%). 
These values were then used to derive a predic- 
tion for the respective undershoot or overshoot 
of manipulated ants, based on the assumption 
that distance is measured by a step integrator, that 
is, that the ants took the same number of strides 
during their outbound journey to the feeder and 
when returning to the nest on manipulated legs 
(hatched boxes in Fig. 3A). The predicted values 
show that there is general agreement between 
the predicted and observed homebound travel 
distances in manipulated ants, further corrobo- 
rating the step integrator hypothesis. 

However, there are some differences (sig- 
nificant in stilts, P = 0.015; not significant in 
stumps, P = 0.125; Mann-Whitney rank sum 
test) between the observed and the predicted 
homebound travel distances. Impairment of the 
ants by the preparation procedures cannot ac- 
count for these effects. Any serious impairment 
would tend to shorten the ants’ homebound 
runs. The animals walking on stilts clearly trav- 
eled for much longer distances than the normal 
controls, however, and they did so with appar- 
ently normal vigor. Indeed, experimentally mod- 
ified ants were observed to stilt or stump through 
their habitat on successful foraging trips several 
times a day and for many days thereafter (see 
movie S1 in SOM). 

The differences between observed and pre- 
dicted homing distances may be attributable, 
though, to altered walking speeds in the ma- 
nipulated ants. In fact, normal ants traveled at 
an average 0.31 m/s, and ants walking on stumps 
averaged 0.14 m/s, a value in good agreement 
with their shortened legs and stride lengths (see 
similarity of predicted and experimentally de- 
termined homing distances in Fig. 3A). Ants on 
stilts however, rather than walking at increased 
speeds, were also slightly slower than normal 
animals, walking at an average 0.29 m/s. This 
was presumably due to the added load of glue 
and pig bristles on their legs. As noted above, 
changed walking speeds are associated with 
correspondingly altered stride lengths, which 
would appear to explain the unexpectedly short 
prediction value in Fig. 3A (top hatched box). 
We thus corrected this prediction with the es- 
tablished relations between walking speed and 
stride length (/9, 20). We assumed as a first 
approximation (and conservatively, when con- 


30 JUNE 2006 VOL 312 SCIENCE www.sciencemag.org 


sidering the actually imposed changes in leg 
length) that ants might run by as much faster on 
stilts as they ran slower on stumps (0.48 m/s, a 
value regularly observed in highly motivated 
normal ants and almost reached by the fastest 
ants on stilts). This procedure indeed yields a 
value that is not significantly different from the 
observed homing distances in ants on stilts 
(open box in Fig. 3, A; 14.25 m, IQR = 3.35 m), 
thus confirming the consistency of our data with 
the step integrator hypothesis. 

The slower speeds of the ants walking on stilts 
further rule out the only alternative explanation of 
our homing distance data (Fig. 3A, solid boxes). 
In principle, a step integrator and a time-lapse 
integrator would both yield the same homing 
distances, even in ants with manipulated leg and 
stride lengths, if only the ants kept their stride 
frequencies constant [or in normal ants, walking 
speed—which in fact they almost do under nor- 
mal conditions (/9, 20)]. Constant stride frequen- 
cy would result in a change in walking speed in 
proportion to altered stride length and a resulting 
difference in homing distance during a set (out- 
bound) travel time. This assumption is evidently 


not correct, though, given the walking speeds of 
the experimental animals. 

Future studies will have to address the mech- 
anism of the proposed step integrator, for 
example, whether it actually registers steps by 
means of proprioreceptors, or whether it inte- 
grates activity of a walking pattern generator, 
and to what extent sensory feedback regarding 
stride length and walking performance is 
considered. 
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Social Modulation of Pain as Evidence 
for Empathy in Mice 


Dale J. Langford, Sara E. Crager, Zarrar Shehzad, Shad B. Smith, Susana G. Sotocinal, 
Jeremy S. Levenstadt, Mona Lisa Chanda, Daniel J. Levitin, Jeffrey S$. Mogil* 


Empathy is thought to be unique to higher primates, possibly to humans alone. We report the 
modulation of pain sensitivity in mice produced solely by exposure to their cagemates, but not to 
strangers, in pain. Mice tested in dyads and given an identical noxious stimulus displayed increased 
pain behaviors with statistically greater co-occurrence, effects dependent on visual observation. 
When familiar mice were given noxious stimuli of different intensities, their pain behavior was 
influenced by their neighbor's status bidirectionally. Finally, observation of a cagemate in pain 
altered pain sensitivity of an entirely different modality, suggesting that nociceptive mechanisms in 


general are sensitized. 


Ithough most consider true empathy 
Ae be an exclusive ability of higher 

primates, empathy may be a phyloge- 
netically continuous phenomenon with sub- 
classes such as “emotional contagion” well 
within the reach of all mammals (/). However, 
there is little evidence for adult-adult empathy 
outside of primates. In rats (2) and pigeons (3), 
the pain-related distress of a conspecific can 
serve as a conditioning stimulus. Rats produced 
operant responses to terminate the distress of a 
conspecific (4), but this might be better ex- 
plained by arousal than altruism (5). One theory 
of human empathy postulates “physiological 
linkage” between empathizing individuals (6). 
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In one study, empathic accuracy for negative 
emotion was highest in those dyads featuring 
high levels of time synchrony of autonomic 
measures (7). We hypothesized that if empathy 
does indeed exist in mice, the real-time ob- 
servation of pain in one mouse might affect the 
responses of its conspecifics to painful stimuli. 

We first used a sensitive nociceptive assay, 
the reflexive 0.9% acetic acid abdominal con- 
striction (“writhing”) test. We placed mice 
singly within transparent Plexiglas cylinders to 
observe writhing behavior. For comparison, we 
placed two same-sex mice within each cylinder 
and injected either one or both mice. In the 
“both writhing” (BW) condition, each mouse 
observed the other in pain; in the “one writh- 
ing” (OW) condition, the injected mouse ob- 
served an uninjected counterpart. BW mice 
displayed significantly more pain behavior than 
isolated mice, but only when their counterparts 


were cagemates (Fig. 1A). The hyperalgesia was 
marginally enhanced in same-sex siblings liv- 
ing together, but a separate experiment con- 
firmed that close genetic relatedness was not 
required (fig. S1). Writhing behavior in BW 
dyads co-occurred in time at levels significantly 
exceeding those expected by chance (Fig. 1B) 
and significantly more so in cagemate pairs 
than stranger pairs. The hyperalgesia and be- 
havior co-occurrence developed over 14 to 21 
days of being housed together (Fig. 1, C and 
D). In general, observed behaviors other than 
writhing were similar across all conditions 
(figs. S2 and S3), although evidence suggested 
higher levels of anxiety or stress produced by 
the noxious stimulus in stranger pairs relative to 
cagemates (fig. S4). Because the observed ef- 
fects on pain behavior were higher in cage- 
mates, stress is not a likely mediator. 

When strangers were tested in dyads, a sig- 
nificant decrease in writhing behavior was ob- 
served in the OW condition compared to that 
observed in isolation (Fig. 1A). The inhibition 
was entirely specific to males (fig. S5) and is 
likely due to distraction or social stress-induced 
analgesia. 

These findings imply the communication of 
pain from one mouse to another. To determine 
the transmitting sensory modality, we blocked 
sensory inputs individually, by placing physical 
barriers to sight and/or touch or by rendering 
mice anosmic or deaf (8). The only manipula- 
tion that significantly abolished the BW/OW 
hyperalgesia was a visual blockade using an 
opaque Plexiglas barrier (Fig. 2A). [Despite their 
albinism, the CD-1 mice used in these studies 
display no deficits in visually dependent be- 
havioral tasks (9).] The opaque barrier also 
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Fig. 1. (A to D) Mice injected with 0.9% acetic acid in the presence 
of similarly injected cagemates display higher levels of pain behavior, 
which co-occurs in time. In all graphs, group sample sizes are 
indicated in italics. (A) Mice were tested in isolation (Isolated), or in 
dyads where either one mouse (One Writhing; OW) or both mice 
(Both Writhing; BW) received acetic acid injections. Bars represent 
the mean + SEM percentage of sampled intervals showing writhing 
behavior (% Samples Writhing). *P < 0.05, ***P < 0.005 by Dunnett 
two-way case-control comparison posthoc test compared to Isolated 
mice. (B) Statistically significant co-occurrence in writhing behavior 
in the Cagemates and Strangers conditions (sign test, P < 0.05 in 
both cases); the co-occurrence was significantly higher in Cagemates. 
Using data from (A), the expected number of samples with writhing in 
both mice of the dyad was calculated as a joint probability. Bars 
represent the mean + SEM excess of observed samples with joint 
writhing above the expected value, as a percentage. **P < 0.01 
compared to Strangers (Student's t test). (C) Data from a separate 
experiment using naive mice housed together for 1, 7, 14, 21, or 28 
days and tested in BW dyads. Isolated mice were taken from the 28- 
day group, but were tested alone. Bars are as in (A). *P < 0.05 by 
Dunnett one-way case-control comparison posthoc test compared to 
Isolated mice. Data in (D) were calculated from subjects shown in 
(Q); symbols represent the mean + SEM excess of observed samples 
with joint writhing above the expected value, as a percentage. *P < 
0.05 compared to zero (sign test). Significant linear trends were 
evinced in (C) and (D) (P = 0.001 and P < 0.005, respectively). 


Fig. 2. (A and B) Apparent dependence of socially mediated 
hyperalgesia and co-occurrence on visual cues. Mice, all 
cagemates (n = 10 to 36 per group; housed together for >21 
days), were tested in dyads as described in Fig. 1, such that 
either one mouse (One Writhing; OW) or both mice (Both 
Writhing; BW) received 0.9% acetic acid injections. “Control” 
data (intact mouse face cartoon) were taken from Cagemates 
condition in Fig. 1 for purposes of comparison. (A) Bars 
represent the mean + SEM percentage of sampled intervals 
showing writhing behavior (% Samples Writhing). *P < 0.05 by 
Student's t test compared to OW group. The significantly lower 
writhing behavior of the Deaf-OW group reflects the relative 
insensitivity to the noxious stimulus of the BALB/c strain, as 
previously reported (24). (B) The abolition of writhing behavior 
co-occurrence in BW dyads in which one mouse is prevented 
from seeing the other (Opaque condition). Bars represent the 
mean + SEM excess of observed samples with joint writhing 
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blocked the co-occurrence of writhing behavior 
in the BW condition (Fig. 2B). Zinc sulfate 
treatment destroys the olfactory epithelium in 
the mouse but spares axonal transport from the 
vomeronasal organ to the accessory olfactory 
bulb (/0), and thus pheromonal communication 
cannot be ruled out. It is, of course, highly 
likely that the recognition of the other mouse in 
the dyad as stranger, familiar, or sibling was 
achieved via olfactory cues (//), which were 
likely unimpeded by the barriers. Indeed, social 
communication is recognized to be commonly 
multimodal in many species (/2). 

An existing data set (/3) provided an inde- 
pendent verification of the social co-occurrence 
of pain behavior in simultaneously tested mice, 
in another assay. In the 5% formalin test, lick- 
ing behavior was statistically time-synchronized 
within runs of four mice tested individually in 
Plexiglas observation cylinders, but in close 


proximity and in full view of each other (figs. 
S6 and S7A). The co-occurrence of pain be- 
haviors in familiar individuals may itself be 
evidence of empathy, representing a compelling 
analog to the demonstrations of physiological 
linkage in empathizing humans (7). 

These formalin data also showed a reduction 
of between-subject variance within a run (fig. 
S7B), suggesting that subjects’ pain behaviors 
were being influenced, perhaps bidirectionally, 
by their neighbors. In a new experiment, we 
compared pain behavior in “both licking” dyads 
in which both mice received either a high 
dose (5%) or a low dose of formalin (1%), or in 
which each mouse received different doses (1%, 
5%). Pain behavior was influenced by that of 
the neighbor mouse, such that licking times 
were marginally increased in mice receiving the 
low dose while observing a high dose—injected 
cagemate, and significantly reduced in mice 


receiving the high dose while observing a low 
dose-injected cagemate (Fig. 3). No significant 
effects were observed among strangers (fig. S9). 

Finally, we investigated whether the obser- 
vation of a cagemate in pain could modulate 
sensitivity to pain of a wholly different modal- 
ity. Mice were tested in dyads as described, but 
in addition to measuring writhing behavior, we 
tested all mice for their sensitivity to withdraw 
from a noxious radiant heat stimulus before and 
at 5-min intervals after injection of acetic acid 
(or no injection). Injection and the mere ob- 
servation of a cagemate’s writhing behavior 
both produced significant and equivalent ther- 
mal hyperalgesia (Fig. 4). No observation ef- 
fects whatsoever were observed among strangers 
(fig. S10). Concurrent thermal pain testing did 
not abolish the BW/OW increase in writhing 
behavior (Fig. 4C), and a significant correlation 
was observed between the writhing behavior of 
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Fig. 3. (A and B) Bidirectional modulation of pain behavior produced by 
observation of a cagemate in the formalin test. Mice, all nonsibling 
cagemates (n = 22 to 24 per group; housed together for >21 days), were 
tested in dyads. In the “Same” condition, both mice received either 1% 
formalin or 5% formalin. In the “Different” condition one mouse received 
1% formalin and the other received 5% formalin. All groups displayed the 
expected biphasic pattern of responding (A and B). A two-way (injected 
dose x observed dose) repeated measures analysis of variance (ANOVA) 
revealed a significant three-way interaction (P < 0.05). (A) Data from all mice 
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the dyad. (B) Data from all mice receiving 5% formalin; the legend describes 
the status of the other mouse in the dyad. In (A) and (B) (note different 
ordinate scales), symbols represent the mean + SEM percentage of sampled 
intervals showing formalin-induced recuperative behavior (% Samples 
Licking) per 5-min time bin. (C) Totals in all conditions from 0 to 40 min 
after injection, after which there was no longer significantly different licking 
behavior between 1% and 5% groups. ANOVA revealed a highly significant 
injected dose x observed dose interaction (F, ,, = 9.3, P < 0.005). *P < 0.05 
compared to analogous 1% condition. tP < 0.05 compared to analogous 


receiving 1% formalin; the legend describes the status of the other mouse in “Same” condition. 
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Fig. 4. (A to D) Thermal hyperalgesia produced by injection of acetic acid, by mere observation 
of a cagemate injected with acetic acid, or both. Mice (all nonsibling cagemates; n = 28 to 31 
per group; housed together for >21 days) were tested in dyads as described in Fig. 1. Before 
injection, all mice were tested for baseline thermal sensitivity. In the BW (“both writhing”) group, 
both mice were removed at time = 0, given an injection of 0.9% acetic acid, and returned to their 
cylinder. In the NW (“none writhing”) group, both mice were removed and replaced, with neither 
receiving any injection. In the OW (“one writhing”) group, one mouse received an acetic acid 
injection (OW-Inj.) and the other (OW-Uninj.) did not. All mice were retested for thermal 
sensitivity at 5-min intervals for 30 min. Symbols in (A) represent the mean + SEM paw- 
withdrawal latencies (average of both hindpaws). Bars in (B) represent the mean + SEM average 
change in paw-withdrawal latencies from the baseline latency. *P < 0.05 compared to NW group 
and zero; tP < 0.05 compared to the group immediately to the left. Bars in (C) represent the 
mean + SEM percentage of sampled intervals showing writhing behavior (% Samples Writhing) of 
mice receiving acetic acid (both mice in BW group; OW-Inj. mice). *P < 0.05 compared to OW-Inj. 
group. (D) A significant correlation (r = —0.40; P < 0.05) between the writhing behavior of one 
mouse in a dyad (ordinate; BW and OW-Inj. only) and the average (postinjection) paw-withdrawal 
latency of its dyadic counterpart (abscissa; BW and OW-Uninj. only). 


one mouse in the dyad and the thermal hy- 
peralgesia exhibited by the other (Fig. 4D). These 
data suggest that the pain system is sensitized in 
a general manner by the observation of pain in a 
familiar, and furthermore demonstrate that so- 
cially mediated hyperalgesia can be elicited in 
the clear absence of imitation. Mechanisms un- 
derlying these phenomena are thus more likely 
to be found in the sensory/perceptual system than 
in the motor system. 

Rodents are known to recognize and have 
emotional reactions to the pain of conspecifics 
(2), and their pain sensitivity can be altered by 
social factors (14-17). However, most of these 
studies reported analgesia rather than hyper- 
algesia and did not evaluate effects in real time, 
when another’s pain was actually being ob- 
served. These phenomena may represent an 
example of coaction social facilitation, de- 
pending on one’s definition of that term (/8). 
However, our findings are consistent with the 
perception-action model of empathy proposed 
by Preston and de Waal (/), both in the auto- 
matic priming of somatic responses in a state 
similar to that of the attended object and in the 
modulating effects of familiarity and similarity 
of experience between subject and object. Our 
observations cannot be easily explained by stress, 
imitation, or conditioning, and they neither de- 
pend on nor necessarily indicate the presence of 
sympathy, conscious (cognitive) representations, 
or altruism. Empathy for pain is currently a topic 
of much study in humans (/9—2/), and “mirror 
neurons” responding to another’s pain may have 
been identified in human anterior cingulate 
cortex (22). A large human literature documents 
the effects on pain report of observation of pain 
in others (23); the present data suggest that 
these effects may be mediated precognitively. 
There are clear limitations to the mechanistic 
information that can be gleaned from human 
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studies; the availability of an animal model of 
empathy will allow the application of far more 
powerful experimental techniques. 
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“ Science 
ii Looks at Life 


WHAT HAPPENS WHEN RESEARCHERS APPLY THE DISCIPLINE OF SCIENCE TO THE RICH 
—often chaotic—ferment of our lives? This issue provides a sampling of provocative 
insights by social scientists studying humans at different stages of the life cycle. We thank 
Robert Axelrod, William Butz, and Barbara Torrey, who aided us in this project, part of 
an ongoing program of encouraging the best in the social sciences. 

Where do the data come from? Butz and Torrey (p. 1898) introduce the special section 
by describing new technologies and some of the challenges that remain in their application. 
Horizons are being opened through use of the Internet, geographical information systems, 
and biomarkers. Genetics is providing new insights into the etiology of our political views, 
according to Kinder (p. 1905), who describes evidence that we are hard-wired for some 
of our political tendencies, although complex changes occur throughout life. 

Helping children grow and develop is something that cuts to the heart of every 
community. Heckman (p. 1900) believes that the United States is not spending its child 
education money wisely—that we spend too little at critical points in child development 
before kindergarten and too much in “second-chance” efforts when interventions 
have become costly and ineffective. Although Richter (p. 1902) also considers early 
involvement to be crucial, she notes that even children who have lived in poverty and 
witnessed or experienced violence in South Africa often find whatever opportunities 
are available to become healthy, caring members of society. 

What is important in life, and would we know it if we saw it? Measuring happiness or 
satisfaction is not the same as measuring height; it changes depending on what we are 
focusing on at the time (Kahneman et al., p. 1908). Our perceptions also tend to change as 
we grow older (or have other reasons to feel that the time left to us is shrinking and finite); 
this can lead us to devote more energy to strengthening current bonds and less to striking 
out on new adventures (Carstensen, p. 1913). We are seeing a shift in the proportions of 
young and old in many parts of the world. A News story by Balter (p. 1894) explores 
declines in human fertility and how fretful governments are responding. Vaupel and 
Loichinger (p. 1911), in their analysis of changing age demographics in Europe, describe 
a way to change the balance between family and work that will strike a responsive chord 
in many working parents. 

Without insights from the social sciences, technological breakthroughs may never be 
translated into real-world solutions, as described in the Editorial by Lane (p. 1847). 
Social sciences are integrated into many parts of Science, in addition to research, as in 
ScienceCareers.org, which has a portal devoted to social, behavioral, and economic sciences. 
This week, Science Careers is focusing on the decision, risk, and management sciences. 
A major avenue of scholarship for social scientists is the book, and the Books section 
(p. 1876) contains reviews of three political science—related offerings. As a further 
resource, readers should note the Science of Aging Knowledge Environment (SAGE KE), 
which has for the past 5 years provided a variety of views on aging, including a Perspective 
this week on changing work patterns (Levine et al.). 

Rarely has a group of papers been so engaging and stimulating as this foray into the 
frontiers of the social sciences. We hope the experience will be one of many. 

—BARBARA JASNY, DONALD KENNEDY, ELIOT MARSHALL 
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NEWS 


The Baby Deficit 


As fertility rates decline across the developed world, governments are 
offering big incentives for childbearing. Experts don’t expect them to 


have much effect 


Last month, from the podium of the Kremlin’s 
grandiose Marble Hall, Russian President 
Vladimir Putin expounded on subjects vital to his 
nation’s future—economic growth, technological 
modernization, and world trade—then he turned 
to the “most important” matter. “What I want to 
talk about,” Putin said in his annual speech before 
the Federal Assembly, “is love, women, children. 
I want to talk about the family, about the most 
acute problem facing our country today—the 
demographic problem.” Reminding the deputies 
that Russia’s 143-million-strong population was 
declining by almost 700,000 people each year, 
Putin proposed a fistful of incentives to boost the 
country’s flagging birthrate. They include raising 
the childcare benefit of 700 rubles ($26) per 
month to 1500 rubles for a first child and 
3000 rubles for a second child, and paying 
18 months of maternity leave equal to at least 
40% of a mother’s previous wages. 

Putin is not the only politician talking about 
babies these days. Earlier this year, Poland’s 
Parliament approved a one-time payment of 
1000 zlotys ($328) for each child born, and this 
month, German Chancellor Angela Merkel 
proposed a l-year paid leave for women who 
have children. When Australia introduced its 
own generous “baby bonus” in 2004, the coun- 


try’s treasurer Peter Costello exhorted parents 
to have “one for Mum, one for Dad, and one for 
the country.” On 1 July, Australia’s bonus will 
jump from $2250 to $3002 per child (in U.S. 
dollars) and will reach $3762 by 2008. Mean- 
while, pro-family inducements have been in 
place for many years in France, Sweden, and 
other European countries. 

Political leaders and economists see plenty 
of justification for spending all this money. In 


“The popularity of baby- 
bonus schemes among 
governments is difficult 


to understand.” 
—Anne Gauthier, 
University of Calgary 


the European Union (E.U.), for example, low 
birthrates have already begun to shrink the 
population, and demographers project that the 
E.U. will lose between 24 million and 40 million 
people during each coming decade unless 
fertility is markedly raised (Science, 28 March 
2003, p. 1991). Population losses could bring a 


raft of negative economic consequences in the 
industrialized world, as well as greater stresses 
on social security and health care systems as 
the proportion of older citizens increases. “The 
changes projected for the United States are not 
as dramatic as those projected for other areas— 
particularly Europe and Japan—but they 
nonetheless present substantial challenges,” 
then—Federal Reserve Board chair Alan 
Greenspan told a 2004 symposium on popula- 
tion aging in Jackson Hole, Wyoming. 

Although these trends are most pronounced 
in the developed world, fertility declines are now 
also being detected even in less affluent areas of 
Latin America and Asia. Roughly half of the 
world’s nations, with more than 40% of the 
human population, now have birthrates below 
replacement levels, and fertility rates are falling 
steadily in most developing countries as well. To 
be sure, demographers predict that the world’s 
population will continue to increase for decades 
to come, rising from its current 6.5 billion to 
somewhere between 8 billion and 11 billion by 
2050 (see sidebar, p. 1896). But nearly all of this 
increase will be in developing countries. 

Population researchers nevertheless are 
currently engaged in a lively debate over just 
what, if anything, developed countries can do 
to increase family size. Some believe very 
low fertility rates are here to stay. “The popu- 
larity of baby-bonus schemes among govern- 
ments is difficult to understand,” says Anne 
Gauthier, a sociologist at the University of 
Calgary in Canada. “While the additional 
financial support is bound to be welcomed by 
parents, the overall effect on fertility is likely 
to be small.” 
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Doing her part. Minister for Families Ursula von 
der Leyen (with her children) oversees Germany's 
effort to increase fertility rates. 


Others argue that even modest boosts in the 
birthrate can make a difference. “We can only 
expect relatively small effects of policy on fertil- 
ity, but relatively small effects are important 
when fertility is low,” says demographer Peter 
McDonald of the Australian National University 
(ANU) in Canberra, whose advocacy of pro- 
family policies helped bring about Australia’s 
baby bonus. Yet both sides agree that falling fer- 
tility rates might be irreversible once they drop 
below a certain level—what some demographers 
have begun to call the “low-fertility trap.” 


The demographic transition 

Predicting population trends is a tricky busi- 
ness, fraught with assumptions about what 
humans are likely to do in the future. Most 
demographers rely on a complex parameter 
called the total fertility rate (TFR). For any par- 
ticular country and year, the TFR is a hypotheti- 
cal measure of the average number of children 
that nation’s women would bear during their 
lifetimes if, at each stage of their lives, they 
behaved exactly like women in each age group 
did during that year. By comparing TFRs from 
one year to another, demographers can track fer- 
tility trends. Leaving aside the effects of immi- 
gration and emigration, if a population is to 
remain the same size, both parents must replace 
themselves. For industrialized countries, 
demographers define a replacement-level TFR 
as 2.1—slightly more than a flat rate, to account 
for the small fraction of children who die before 
reaching reproductive age. 

Yet nearly all of the world’s industrialized 
nations have TFRs well below this magic 
number. Russia’s current TFR is only 
1.28 (which ties it with Italy and 
Spain), Poland’s is 1.25, Germany’s 
is 1.39, and Australia’s is 1.76, which 
helps explain the alarm expressed 
by political leaders in those 
countries. Even the E.U. nations 
with the highest birthrates, France 
and Ireland, are falling short of 
replacement, with TFRs of 1.84 
and 1.86, respectively. Nor is the 
baby shortage restricted to Europe: 

South Korea’s TFR is 1.27 and 
Japan’s is 1.25. Only the United 
States, exceptional in the developed 
world, hits the replacement mark, with 
a TFR of 2.09. 

Today’s low TFRs are an unexpected 
consequence of a so-called demographic 
transition to lower fertility rates that began in 
Europe in about 1800 and is still taking place in 
much of the world. As advances in health and 


Proud papa. Australian Treasurer Peter Costello fathered a baby- 
bonus scheme. 


hygiene increased the likelihood of a child sur- 
viving to reproduce, both death and birthrates 
started to fall, especially in industrialized coun- 
tries. Although TFRs remain high in some of the 
world’s poorest countries—Niger has the high- 
est TFR, 7.46—the demographic transition is 
either under way or completed in most nations. 
The process has taken place even in relatively 
poor countries such as Mexico, where TFR 
dropped from 6.5 to 2.5 between 1975 and 2005, 
and the Philippines, which saw a decline from 
6.0 to 3.2 during the same period. However, 
demographers had assumed that the decline 
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would stop when replacement-level 
TFRs were reached. “During the 
early 1970s, everyone talked about 
the magic floor of replacement,” 
says David Reher, a population 
historian at the Complutense Uni- 
versity of Madrid, Spain. “Nobody 
thought it would go below 2.1.” 

Yet by 1975, several European 
countries, as well as the United 
States and Canada, had already 
dipped below this floor. (Although 
the United States has now come 
back up to replacement level, 
Canada’s TFR has continued to 
plummet and now stands at 1.61.) 
This trend, which many demogra- 
phers and economists call the 
“second demographic transition,” 
has its roots in the social changes 
that swept much of the Western 
world during the 1960s and 1970s. 
As women entered the labor force in increasing 
numbers and obtained easier access to effective 
contraception and as conflicts between work 
and childbearing intensified, parents began to 
delay the timing of their first child, which 
inevitably led to a reduction in the total number 
of offspring. These shifts were accompanied by 
a constellation of new attitudes toward family, 
career, and personal autonomy that are not eas- 
ily quantified, researchers say. “Human repro- 
ductive behavior is profoundly social,” says 
Jennifer Johnson-Hanks, a demographer and 
anthropologist at the University of California, 
Berkeley. “It is structured by social categories, 

value systems, and power relations.” John 

Bongaarts, a demographer at the Popu- 
lation Council in New York City, adds 
that personal choice has come to 
play a much bigger role in repro- 
ductive decisions. In earlier 
days, Bongaarts says, “people 
tended to do what society 
expected of them. Over 
time, individual agency has 
become more important.” 
Social factors also 
explain the United States’s 
anomalously high fertility 
rate, population experts say. 
Although relatively higher 
birthrates among some ethnic 
groups and more recently 
arrived immigrants, including 
Hispanics, explain part of the 
difference, the TFR for non- 
Hispanic whites is still about 


_ end 1.85, equivalent to the highest 
| i: . rates seen in Europe. “There 


are several factors that make 
the TFR in the US. higher than 
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Life Cycles 


in many European countries,” Bongaarts says, 
including a higher rate of unwanted pregnan- 
cies due to restrictions on birth-control infor- 
mation, a lower unemployment rate, and a 
greater tendency for women to have children 
earlier in life than in Europe. Gauthier adds 
that a stronger emphasis on religion and “tra- 
ditional values” in the United States also 
tends to favor larger families. 


Aged and dependent 

The key reason that economists and other 
experts are worried about low fertility rates is 
that they accelerate an overall “aging” of a pop- 
ulation, in which the proportion of elderly adults 
relative to the active labor force increases. The 
consequences of an increase in this so-called 
dependency ratio are hard to predict, says 
demographer James Trussell of Princeton Uni- 
versity. “The economic burden of the elderly 
will depend on their health, on employment 
opportunities, and on the social institutions that 
support their care,” Trussell says. “But it is clear 
that it will be a challenge.” One way that many 
developed countries meet the challenge now is 
through immigration, which tends to increase 
the number of younger workers. Yet few demo- 
graphers see immigration as the answer. 

“As a short-term solution, it is necessary, and 
it is happening,” says Reher. “But there are very 
serious doubts about whether it is a long-term 
solution. Migrant fertility starts higher than that 
of the native population but very quickly 
descends towards local fertility levels.” Trussell 
agrees: “To have an appreciable effect on the 
aging of a population, you would need massive 
immigration, which is not politically acceptable 
in either Europe or the U.S.” 

That leaves raising birthrates as the only solu- 
tion, assuming that a solution to low fertility rates 
is possible—and desired. Some demographers 
take heart in an apparent gap between how many 
children parents would ideally like to have if they 
felt they could manage it and how many they 
actually do have. In this gap, some see wiggle 
room for fertility-enhancing policies. Thus, 
public-opinion surveys carried out by the E.U. as 
part of its Eurobarometer program have sug- 
gested that this gap amounts to an average of 
about 0.5 children per woman. Indeed, baby 
bonuses and other pro-family measures are in 
part designed to make it easier financially for 
families to fulfill this ideal. But Gauthier questions 
whether the gap is actually that large. In a study in 
press at the journal Population Research and 
Policy Review, she concludes that the “window of 
opportunity” for family policies might actually be 
as little as 0.1 to 0.2 children per woman. 

Gauthier and other researchers agree never- 
theless that pro-family policies have had some 
positive effect on fertility rates in countries such 
as France, whose TFR of 1.84 is the second high- 


The Bomb That Wasn't 


When Stanford University entomologist Paul Ehrlich published The Population Bomb in 1968, the 
world’s human population was about 3.5 billion. Today, it is approximately 6.5 billion. Yet the worst 
of Ehrlich’s widely publicized predictions, including the starvation of hundreds of millions of people 
in mass famines, have not come true. Still, the world’s population is expected to continue to grow 
until at least 2050, according to estimates by the United Nations Population Division 
(esa.un.org/unpp). Just how much it will increase depends on future fertility, which is very difficult to 
predict. U.N. population experts have examined three hypothetical fertility trends, which they term 
medium, low, and high. Under the medium scenario, population would reach 9.1 billion by 2050, 
but the low and high scenarios project as few as 7.6 billion people and as many as 10.6 billion. 

Nearly all of this growth will be in developing countries, with major contributions from nations 
such as India, Pakistan, Nigeria, Bangladesh, and China. (Even the United States, with its relatively 
youthful population, will add significant numbers.) Fueled by very high fertility rates, between now 
and 2050, population is expected to at least triple in some nations, such as Afghanistan, Burundi, 
Chad, Democratic Republic of Congo, Mali, and Uganda—despite high HIV infection rates in 
many African countries. Yet over the long term, fertility is expected to drop dramatically in even the 
poorest countries, from an average of five children per woman now to about 2.6 in 2050; and under 
the U.N.’s medium scenario, average worldwide fertility will decline to 2.05 by 2050, and to just 
over 1.5 in the low scenario, well below the replacement level. 

“Virtually all countries are headed towards replacement-level fertility or below,” says Ronald 
Lee, a demographer at the University of California, Berkeley. “But there may be pauses and 
reversals along the way, sometimes lasting decades.” If so, the population bomb may ultimately 
fizzle out—that is, assuming an already stressed planet can survive the onslaught of 9 billion 


human beings. 


est in the E.U. after Ireland. “There are no fewer 
than 38 measures in favor of families with chil- 
dren,” says demographer Laurent Toulemon of 
the National Institute of Demographic Studies in 
Paris. For example, mothers receive 16 weeks of 
maternity leave at more than 80% of their 
normal pay, which is extended to 26 weeks 
beginning with a third child. Parents also receive 
numerous direct allowances to help provide for 
young children, and the number of publicly 


Allons les enfants! France guarantees nursery 
school spots to nearly all children. 


-M.B. 


funded nursery schools has expanded in recent 
years to the extent that nearly every child is 
guaranteed a place. In fact, there are so many 
pro-family policies, says Toulemon, “that it is 
almost impossible to evaluate the impact of each 
one” on fertility. 

Despite these generous allotments, however, 
France’s relatively high fertility rate in European 
terms is still below replacement. The same is true 
of Sweden, where government officials credit 
bountiful policies designed to make life easier 
for working parents with recent gains in TFR 
from about 1.6 to 1.8. Yet Gigi Santow, formerly 
of Stockholm University and now an independ- 
ent demographer in Sydney, Australia, says that 
this fertility jump was not due to baby bonuses or 
other direct attempts to create a baby boom. 
“Swedish fertility rates may well have responded 
to the government’s integrated web of cradle- 
to-grave social policies,” Santow says. She adds 
that fertility plummeted during the economic 
recession that hit Sweden during the 1990s, 
despite the policies then in place. 

Proving that financial incentives can actually 
raise fertility rates is very difficult—and demog- 
raphers do not always agree. “We cannot carry 
out an experiment,” says Gauthier. “We can only 
look historically at what has happened and rely 
on cross-national differences in policies.” Earlier 
this month, for example, Australia’s news media 
were abuzz with reports of the latest birth figures 
from the Australian Bureau of Statistics, show- 
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ing that 261,404 babies were born in 2005, 
2.4% more than the previous year and the 
highest number since 1992. Treasurer Costello 
was quick to credit the baby bonus: The daily 
newspaper The Australian quoted Costello as 
“delighted that at least some families have been 
taking up the challenge.” 

ANU’s McDonald says that although it is 
too early to carry out “rigorous research” on the 
reasons for the increase, most of the additional 
births are to women in the middle to late part of 
their childbearing years. This suggests that the 
message may have been heard: “If you want to 
have children, it is risky to delay too long,” 
McDonald says. And although McDonald 
concedes that “most of the 261,000 women 
who gave birth in 2005 would have had the baby 
without” the baby bonus, the extra money “can 
make a difference” to middle income families 
who make “close calculations” about the impact 
of parenting. McDonald estimates that 
Australia’s TFR for 2005, when published in 
November, should rise from about 1.76 to 1.82. 

But Robert Birrell, director of the Center for 
Population and Urban Research at Monash Uni- 
versity in Clayton, Australia, says that a number 
of other factors may have weighed much more 
heavily, especially “the impact of the current 
economic boom in Australia, which has seen an 
increase in the rate of employment for men and 
particularly women in recent years.” Santow 
agrees: “I would not leap immediately to the 
conclusion that Peter Costello should be given 
the credit.” 


Low-fertility spiral 

The uncertain response to incentives suggests 
to some demographers that governments need 
to do even more to make child rearing attrac- 
tive. “Many things that we’ve tried aren’t big 
enough,” says Bongaarts. ““To move behavior, 
you need real incentives; you need thousands 
of dollars. ... You have to pull all the levers 
you have, and maybe you will get halfway 
there.” But pulling those levers might end up 
being too costly, Trussell says. “Policies that 
would work would be so expensive that they 
will never be implemented.” 

And some researchers have begun to think 
that it might actually be too late to reverse the 
trend in countries with the lowest fertility levels. 
At several recent population meetings, for 
example, McDonald has warned that once a 
nation’s TFR falls below 1.5, a downward 
demographic spiral sets in that makes it much 
more difficult to recover. “This is the safety 
zone,’ McDonald says. “Countries should try 
hard to avoid falling below it.” 

A team led by Wolfgang Lutz, a demographer 
at the International Institute for Applied Systems 
Analysis (IIASA) in Laxenburg, Austria, has 
taken McDonald’s observation further and 


argued that countries with a TFR of 1.5 or lower 
may have crossed into permanent negative pop- 
ulation growth. Lutz calls this hypothesis, which 
he presented most recently at this spring’s 
annual meeting of the Population Association of 
America in Los Angeles, California, the “low- 
fertility trap.” Lutz and other colleagues at 
IIASA and the Vienna Institute of Demography 
argue that the new social norms created by low 
fertility rates create a self-reinforcing negative 
feedback loop. It is locked in place by a reduc- 
tion in ideal family size, aging of the population, 
and other effects on the labor market that make 
having fewer and fewer children inevitable. As 
evidence, Lutz and his colleagues cite data 
from the Eurobarometer survey showing that in 
Germany and Austria—nations with TFRs of 
1.39 and 1.36, respectively—young adults now 
consider their ideal family sizes to be as low as 
1.7 children on average. 

“Germany is the extreme example of this 
phenomenon, with around 30% of young people 
not intending to have children,” says McDonald. 


Trends in Total Fertility Rates 
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Reher argued that low fertility rates were now 
entrenched in the social structure of developed 
countries and a growing number of developing 
countries as well. Although the momentum of 
past high fertility rates would continue to fuel an 
increase in the entire world’s population for 
some decades to come, this would eventually 
stop. Rather, Reher maintained, much of the 
world is now on the cusp ofa prolonged period of 
population decline. The resulting population 
aging would lead to labor shortages even in 
developing countries. The result could be an eco- 
nomic disaster, Reher warned. “Urban areas in 
regions like Europe could well be filled with 
empty buildings and crumbling infrastructures 
as population and tax revenues decline,” he 
prognosticated, adding that “it is not difficult to 
imagine enclaves of rich, fiercely guarded 
pockets of well-being surrounded by large areas 
which look more like what we might see in some 
science-fiction movies.” 

Most population researchers agree that 
there is plenty to worry about in current world- 
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On the other hand, McDonald does not agree that 
there is no turning back for countries whose 
TFRs fall this low: “This does make Germany a 
tougher nut to crack, but I would never declare 
the game as over.” 

Yet Reher sees little reason for optimism. 
“When fertility is drastically below replacement, 
it doesn’t go up, no matter how many policies 
and how much money is thrown at it,” he says. 
“We are in the midst of a cascading fertility 
decline. Even a TFR of 1.7 is not safe; it is a 
disaster if you look a couple of generations 
down the line.” 

Indeed, Reher, at the July 2005 annual meet- 
ing of the International Union for the Scientific 
Study of Population in Tours, France, presented a 
paper suggesting an even more dismal picture. 


1985 
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wide demographic trends. Yet few are ready 
to accept the direst parts of Reher’s doomsday 
scenario—at least not yet. “I wouldn’t be 
surprised” if population shrinkage “happens in 
a lot of places in the world,” says Gauthier, 
although she adds that “it is much harder to 
believe in Africa,” where the population is 
expected to at least double by 2050. And Santow 
comments that although Reher’s predictions 
“may well be sensible,” she sees “nothing 
terrifying about a drop in the size of Europe’s 
population. Any decline will take time, and 
economies will adjust. Governments should 
not expend energy to maintain the status quo. 
Governments should plan for the future, not try 
to reintroduce the past.” 

—-MICHAEL BALTER 
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LIFE CYCLES 


PERSPECTIVE 


Some Frontiers in Social Science 


William P. Butz* and Barbara Boyle Torrey 


The fundamental challenge in the social sciences is moving from complicated correlations to useful 
prediction. Progress usually reflects an interplay between theory, data, and tools. Six areas of 
innovation, principally data and tools, are now pushing at the frontiers of these sciences: 
longitudinal data, laboratory experimentation, improved statistical methods, geographic informa- 
tion tools, biosocial science, and international replication. These innovations are gaining power as 
they cross disciplinary boundaries, helping to attribute causality to observed relationships, to 
understand their nature, and thereby to improve the accuracy and usefulness of predictions. 


sciences is moving from complicated 

correlations to useful prediction (/). The 
variance in human behavior and social adapta- 
tions is so large that it becomes difficult to dis- 
tinguish signals from noise in data, even where 
theory supplies clear guidance. However, new 
data, methodologies, and tools from both inside 
and outside the social sciences are demonstrat- 
ing real promise in advancing these sciences 
from descriptive to predictive ones (2). 


T= fundamental challenge in the social 


Longitudinal Surveys 

Longitudinal surveys, which collect information 
about the same persons over many years, have 
given the social sciences their Hubble 
telescope. Both allow the observing 
researcher to look back in time and 
record the antecedents of current 
events and transitions. These lon- 
gitudinal data sets, beginning 40 
years ago with the Panel Study of 
Income Dynamics (3), document 
the importance of accumulated life 
experience in causing transitions 
from health to infirmity; from work 
to unemployment or retirement; 
and across states of marriage, fam- 
ily structure, and wealth. 

Similar longitudinal surveys in 
different countries are beginning 
to reveal the institutional and cul- 
tural variations in these life cycle 
dynamics (4). A new investment in 
international longitudinal surveys 
on aging may eventually unravel 
the changing dynamics at the end of the life 
cycle across time and countries (5). One recent 
comparison showed that health improved with 
socio-economic status in both England and the 
United States. It also showed, however, that 
English health status is better than in the United 
States at each socio-economic level (6). The 
challenge now is to use the accumulated lon- 
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gitudinal data on individuals to explain these 
differences in health. 


Laboratory Experimentation 


The new field of behavioral social science in- 
tegrates insights from across the social and 
behavioral sciences with data from the neuro- 
sciences to better understand economic, social, 
and political outcomes (7). An important tool of 
behavioral social science is controlled laboratory 
experimentation, usually conducted at computers 
where subjects respond to specific instructions 
and online stimuli. Game theory, risk and deci- 
sion science, and experimental social psychology 
have provided most of the theory and methods 


Fig. 1. (© 2000) Jack Zielgler/The New Yorker Collection/Cartoonbank.com 


for this new approach (8). The objects of inquiry 
span all the social sciences, including the origins 
and impacts of ethnic conflict (9), group and 
team behavior in organizations (/0), and the 
nature and consequences of trust and reciprocity 
in interpersonal and international relations. By 
manipulating participants’ instructions, incen- 
tives, constraints, and rewards in an experimental 
setting, investigators can fine-tune the parame- 
ters of real-world applications. Such applications 
are by now diverse and their use has spread 
around the globe: broadband spectrum auctions 
(1), tradable permits to pollute (/2), airport 


landing rights algorithms, voucher plans for 
schools, and income-contingent loans for higher 
education. One current challenge among many is 
designing mechanisms to induce development 
and distribution of vaccines for neglected dis- 
eases that affect poor countries (/3). 

More generally, research is now testing 
whether behavior in a controlled game indeed 
predicts future real-world behavior of the in- 
dividuals who played the game (/4), whether 
such behavior corresponds to evidence from 
independent sample surveys, and whether field 
experiments outside the laboratory support the 
findings (/5). As Fig. 1 illustrates, better to ex- 
periment with different settings of the “life 
machine” in the laboratory before imposing 
them through organizations and governments on 
the real lives of people. 


Improved Statistical Methods 


The Internet provides ready and inexpensive 
access around the world to hundreds of thou- 
sands of potential survey respondents, many 
more than even the largest conventional survey 
can include (6). Such respondents could also be 
experimental subjects, reacting to the experimen- 
tal design in real time and in unprecedented 
numbers. These respondents could be preselected 
as a controlled sample, but greater potential lies 
in granting easy access to anyone 
who wants to participate, resulting 
in huge “convenience samples” 
that do not represent any known 
larger population. The challenge, to 
draw proper inferences to a uni- 
verse larger and more interesting 
than just the volunteer survey re- 
spondents, has been well known for 
decades in the context of program- 
and clinic-based data. What is 
different now is the suddenly large 
payoff from solving this challenge 
of respondent selection bias (77). A 
promising approach is to reweight 
the data based on the known distri- 
bution of some key variables that 
are thought to capture the differ- 
ence between Web survey respon- 
dents and the target population. The 
idea is that the weights used to 
correct for the discrepancy to the known 
distribution of the key variables will also remove 
or reduce bias for other variables in the survey 
(8, 19). Another approach is to test theories on 
multiple independent Web samples (20). 

There is a debilitating tradeoff between 
the power of richly detailed data on individuals 
(and firms and other organizations) to test im- 
portant hypotheses, on the one hand, and the 
possibility that such detailed information will 
threaten the privacy and confidentiality of the 
persons and organizations described in the data, 
on the other hand. The masking of subject iden- 
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tity arises in many fields of science but is par- 
ticularly vexing in the social sciences where 
available masking techniques necessarily reduce 
the amount of information in the data. Exact- 
matched data files from multiple surveys and 
administrative record sources are particularly 
valuable analytically, but also particularly at risk 
for disclosure. Consider, for example, matched 
files that contain demographic and attitudinal in- 
formation from persons in surveys, economic 
information on these same persons from govern- 
ment records, and medical information from 
providers. When longitudinal data are so matched, 
the analytical payoff can be unprecedented at 
relatively little cost in time and money. 

Some researchers now have increased access 
to such data through cumbersome administrative 
and legal arrangements, such as traveling to 
special protected facilities or submitting statistical 
software to be run in a protected facility (2/, 22). 
These challenges arise particularly but not exclu- 
sively with data files collected by the federal 
government. Statistical methods of masking 
subject identities while retaining sufficient infor- 
mation for analysis are under development (23). 
Breakthroughs here will have substantial scien- 
tific payoff. 


Geographic Information Tools 


The context of human social interaction has 
always been critical for understanding social out- 
comes. The integration of Geographic Information 
Science (GIS) and Geographic Positioning Sys- 
tems (GPS) with social science is beginning to 
socialize the pixel by providing a geographic con- 
text for social behavior (24). These new tools pro- 
vide the ability to analyze social behavior across 
time and geographic scales, although their adop- 
tion by social scientists has yet to approach their 
potential. The integration of geography and macro- 
economics will allow researchers to estimate how 
much of the regional differences in per capita output 
and output per geographic area is due to country- 
specific factors, such as institutional differences, 
and how much to other factors, such as geography 
(25). This integration may eventually disentangle 
the explanations for why some countries and re- 
gions within countries lag economically. In ad- 
dition, the global distribution of population by 
elevation will provide future estimates of coastal 
and biological hazards as well as insights into hu- 
man settlement patterns (26). Spatial integration of 
population change and consumption patterns may 
also become increasingly important in understand- 
ing environmental and climate changes (27). 


Biosocial Science 


No field outside the social sciences is having as 
much of an impact within the field as biology. 
This new biosocial science may well alter under- 
standings about sexual orientation, criminal re- 
sponsibility, prospects for marriage as a social 
institution, and even the nature of moral obligation. 


Mapping of the human genome has contributed 
fundamentally to understanding how the human 
body has evolved, how it operates, and how it 
malfunctions. Genomics has also provided the tools 
to understand important social science phenomena 
such as prehistoric human migrations (28). The 
International Hap Map project will go much further 
in exploring the nature of genetic variation among 
human beings. This project is challenging the racial 
and ethnic constructs that much of social science 
has used from its beginnings (29). Although de- 
terminants of human behavior are both genetic and 
environmental, biosocial research may also alter 
our understanding of their relative impacts. 

Neurosocial science uses neuroimaging to 
study the origins and nature of economic, social, 
and political behavior, examining such phenome- 
na as decision-making, empathy, and time pref- 
erence. The work is mainly descriptive so far but 
is beginning to suggest fundamental distinctions 
between behaviors that have been assumed to 
have identical causes and characteristics. For 
example, two separate neural systems are in- 
volved in decisions between monetary award op- 
tions, depending on time to delivery (30). There is 
also evidence for a neurobiological link between 
the experienced displeasure of dread and subse- 
quent decisions about unpleasant outcomes (37). 

Social scientists are also collaborating with 
animal behaviorists to test the species general- 
ity of hypotheses originally generated to ex- 
plain human behavior. The observations that 
animals can count, plan ahead, and make de- 
cisions under uncertainty, and that most other 
primate females also live longer than males, 
call into question some long-standing beliefs 
about “human” behavior (32). 

Field surveys of economic, social, and health 
phenomena are increasingly incorporating bio- 
markers collected from the survey respondents, 
in order to study biosocial interactions in fer- 
tility, work, health, and aging (33). Indeed, bio- 
markers on social surveys may well reveal more 
about subjects’ predispositions and their ances- 
try than do their verbal responses on which so- 
cial scientists have historically depended. 

Over the past two decades, the theory of evo- 
lution has influenced parts of economics and 
psychology, and to a lesser extent sociology, an- 
thropology, and political science (34, 35). Where 
do our behaviors and their reflections in our 
organizations (including the family) come from? 
What constraints on our “freedom” do our hard- 
wired tendencies impose? Are there cross-cultural 
differences? What refutable predictions does an 
evolutionary approach imply that can be tested 
against other theories? What are the practical and 
policy implications? These questions, largely 
unanswered, are fundamentally important. 


International Replications 


International replication of national findings is 
critical for distinguishing between local and 
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universal phenomena. The recent development of 
internationally harmonized cross-sectional data sets 
has already demonstrated their potential to generate 
hypotheses for future research. The Demographic 
and Health Surveys have documented fertility 
declines in developing countries, but have raised 
major questions about why the patterns of decline 
vary so much (36). The Luxembourg Income 
Study has shown the income vulnerability of 
single-parent families in developed countries but 
has raised questions about why some countries are 
so much more efficient at compensating for the 
vulnerability than others (37). The World Value 
Survey has documented the trends toward secu- 
larism in developed countries over the past two 
decades, which has raised questions about why 
the United States is an exception (38). Some 
future breakthroughs in understanding social dy- 
namics will begin with questions raised by these 
kinds of cross-sectional comparisons. 


Conclusions 


Progress in the social sciences usually reflects an 
interplay between ideas, data, and tools. In our 
view, this interplay at the frontiers has in recent 
decades been driven more by new data, tools, 
and methods and less by theoretical advances 
and novel hypotheses, although in some areas 
these have been plentiful. Many of these inno- 
vations have gained power as they crossed dis- 
ciplinary boundaries, some from outside the 
social sciences entirely. Those discussed here 
have more potential than has been realized so far 
to attribute causality to observed relationships, to 
understand their nature, and thereby to improve 
the accuracy and usefulness of predictions. 
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PERSPECTIVE 


Skill Formation and the Economics of 
Investing in Disadvantaged Children 


James J. Heckman 


This paper summarizes evidence on the effects of early environments on child, adolescent, and 
adult achievement. Life cycle skill formation is a dynamic process in which early inputs strongly 


affect the productivity of later inputs. 


our core concepts important to devising 
sound social policy toward early childhood 
have emerged from decades of independent 
research in economics, neuroscience, and develop- 
mental psychology (/). First, the architecture of the 
brain and the process of skill formation are 
influenced by an interaction between genetics and 
individual experience. Second, the mastery of skills 
that are essential for economic success and the 
development of their underlying neural pathways 
follow hierarchical rules. Later attainments build 
on foundations that are laid down earlier. Third, 
cognitive, linguistic, social, and emotional com- 
petencies are interdependent; all are shaped power- 
fully by the experiences of the developing child; 
and all contribute to success in the society at large. 
Fourth, although adaptation continues throughout 
life, human abilities are formed in a predictable 
sequence of sensitive periods, during which the 
development of specific neural circuits and the be- 
haviors they mediate are most plastic and therefore 
optimally receptive to environmental influences. 
A landmark study concluded that “virtually 
every aspect of early human development, from 
the brain’s evolving circuitry to the child’s ca- 
pacity for empathy, is affected by the environ- 
ments and experiences that are encountered in a 
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cumulative fashion, beginning in the prenatal pe- 
riod and extending throughout the early child- 
hood years” (2). This principle stems from two 
characteristics that are intrinsic to the nature of 
learning: (1) early learning confers value on ac- 
quired skills, which leads to self-reinforcing moti- 
vation to learn more, and (ii) early mastery of a 
range of cognitive, social, and emotional compe- 
tencies makes learning at later ages more efficient 
and therefore easier and more likely to continue. 


Early family environments are major predic- 
tors of cognitive and noncognitive abilities. 
Research has documented the early (by ages 4 
to 6) emergence and persistence of gaps in cog- 
nitive and noncognitive skills (3, 4). Environ- 
ments that do not stimulate the young and fail to 
cultivate these skills at early ages place children 
at an early disadvantage. Disadvantage arises 
more from lack of cognitive and noncognitive 
stimulation given to young children than simply 
from the lack of financial resources. 

This is a source of concern because family 
environments have deteriorated. More U.S. chil- 
dren are born to teenage mothers or are living in 
single parent homes compared with 40 years ago 
(5). Disadvantage is associated with poor parent- 
ing practices and lack of positive cognitive and 
noncognitive stimulation. A child who falls be- 
hind may never catch up. The track records for 
criminal rehabilitation, adult literacy, and public job 
training programs for disadvantaged young adults 
are remarkably poor (3). Disadvantaged early en- 


65 
Highest income quartile 
60 } 
2 
= asl Third income quartile 
8 
@ 50 
a 
co) 
5 — 
a — Second income quartile 
40> ane 
Lowest income quartile 
35 : . - 


10 12 


Age of child 


Fig. 1. Average percentile rank on Peabody Individual Achievement Test—Math score by age and income 
quartile. Income quartiles are computed from average family income between the ages of 6 and 10. 


Adapted from (3) with permission from MIT Press. 
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vironments are powerful predictors of adult failure 
on a number of social and economic measures. 

Many major economic and social problems 
can be traced to low levels of skill and ability in the 
population. The U.S. will add many fewer college 
graduates to its workforce in the next 20 years than 
it did in the past 20 years (6, 7). The high school 
dropout rate, properly measured with inclusion of 
individuals who have received general educational 
development (GED) degrees, is increasing at a 
time when the economic return of schooling has 
increased (8). It is not solely a phenomenon of 
unskilled immigrants. Over 20% of the USS. 
workforce is functionally illiterate, compared with 
about 10% in Germany and Sweden (9). Violent 
crime and property crime levels remain high, 
despite large declines in recent years. It is 
estimated that the net cost of crime 
in American society is $1.3 trillion 
per year, with a per capita cost of 
$4818 per year (10). Recent research 
documents the importance of deficits 
in cognitive and noncognitive skills 
in explaining these and other social 
pathologies (//). 


Noncognitive Skills and Examples 
of Successful Early Interventions 


Cognitive skills are important, but 
noncognitive skills such as motiva- 
tion, perseverance, and tenacity are 
also important for success in life. 
Much public policy, such as the No 
Child Left Behind Act, focuses on 
cognitive test score outcomes to 
measure the success of interventions 
in spite of the evidence on the im- 
portance of noncognitive skills in 
social success. Head Start was deemed 
a failure in the 1960s because it did 
not raise the intelligence quotients 
(IQs) of its participants (/2). Such 
judgments are common but miss the 
larger picture. Consider the Perry 
Preschool Program (/3), a 2-year 
experimental intervention for disad- 
vantaged African-American children 
initially ages 3 to 4 that involved morning pro- 
grams at school and afternoon visits by the teacher 
to the child’s home. The Perry intervention group 
had IQ scores no higher than the control group by 
age 10. Yet, the Perry treatment children had higher 
achievement test scores than the control children 
because they were more motivated to learn. In 
followups to age 40, the treated group had higher 
rates of high school graduation, higher salaries, 
higher percentages of home ownership, lower 
rates of receipt of welfare assistance as adults, 
fewer out-of-wedlock births, and fewer arrests 
than the controls (73). The economic benefits of 
the Perry Program are substantial (Table 1). Rates 
of return are 15 to 17% (/4). (The rate of return is 
the increment in earnings and other outcomes, 


Rate of return to investment in human capital 


0 


suitably valued, per year for each dollar invested 
in the child). The benefit-cost ratio (the ratio of 
the aggregate program benefits over the life of 
the child to the input costs) is over eight to one. 

Perry intervened relatively late. The Abecedar- 
ian program, also targeted toward disadvantaged 
children, started when participants were 4 months 
of age. Children in the treatment group received 
child care for 6 to 8 hours per day, 5 days per week, 
through kindergarten entry; nutritional supple- 
ments, social work services, and medical care 
were provided to control group families. The 
program was found to permanently raise the IQ 
and the noncognitive skills of the treatment group 
over the control group. However, the Abecedarian 
program was intensive, and it is not known 
whether it is the age of intervention or its inten- 
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Fig. 2. Rates of return to human capital investment in disadvantaged children. The 
declining figure plots the payout per year per dollar invested in human capital 
programs at different stages of the life cycle for the marginal participant at current 
levels of spending. The opportunity cost of funds (r) is the payout per year if the 
dollar is invested in financial assets (e.g., passbook savings) instead. An optimal 
investment program from the point of view of economic efficiency equates returns 
across all stages of the life cycle to the opportunity cost. The figure shows that, at 
current levels of funding, we overinvest in most schooling and post-schooling 
programs and underinvest in preschool programs for disadvantaged persons. 
Adapted from (3) with permission from MIT Press. 


sity that contributed to its success in raising IQ 
(15-17). 

Reynolds et al. present a comprehensive 
review of early childhood programs directed 
toward disadvantaged children and their impact 
(/8). Similar returns are obtained for other early 
intervention programs (J9, 20), although more 
speculation is involved in these calculations be- 
cause the program participants are in the early 
stages of their life cycles and do not have long 
earnings histories. 


Schools and Skill Gaps 

Many societies look to the schools to reduce skills 
gaps across socioeconomic groups. Because of the 
dynamics of human skill formation, the abilities 


Opportunity 
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and motivations that children bring to school play 
a far greater role in promoting their performance 
in school than do the traditional inputs that receive 
so much attention in public policy debates. The 
Coleman Report (2/) as well as recent work 
(22, 23) show that families and not schools are the 
major sources of inequality in student performance. 
By the third grade, gaps in test scores across socio- 
economic groups are stable by age, suggesting that 
later schooling and variations in schooling quality 
have little effect in reducing or widening the gaps 
that appear before students enter school (4, 24). 
Figure 1| plots gaps in math test scores by age across 
family income levels. The majority of the gap at age 
12 appears at the age of school enrollment. Carneiro 
and Heckman performed a cost-benefit analysis of 
classroom size reduction on adult earings (3). 
Although smaller classes raise the 
adult earnings of students, the earn- 
ings gains received by students do not 
offset the costs of hiring additional 
teachers. The student-teacher achieve- 
ment ratio (STAR) randomized trial 
of classroom size in Tennessee shows 
some effect of reduced classroom size 
on test scores and adult perform- 
ance, but most of the effect occurs in 
the earliest grades (25, 26). Schools 
and school quality at current levels 
of funding contribute little to the 
emergence of test score gaps among 
children or to the development of 
the gaps. 


Second Chance Programs 


America is a second chance society. 
Our educational policy is based on a 
fundamental optimism about the 
possibility of human change. The 
dynamics of human skill formation 
reveal that later compensation for de- 
ficient early family environments is 
very costly (4). If society waits too 
long to compensate, it is economical- 
ly inefficient to invest in the skills of 
the disadvantaged. A serious trade-off 
exists between equity and efficiency 
for adolescent and young adult skill policies. 
There is no such trade-off for policies targeted 
toward disadvantaged young children (28). 

The findings ofa large literature are captured in 
Fig. 2. This figure plots the rate of return, which is 
the dollar flow from a unit of investment at each 
age for a marginal investment in a disadvantaged 
young child at current levels of expenditure. The 
economic return from early interventions is high, 
and the return from later interventions is lower. 
Remedial programs in the adolescent and young 
adult years are much more costly in producing the 
same level of skill attainment in adulthood. Most 
are economically inefficient. This is reflected in 
Fig. 2 by the fact that a segment of the curve lies 
below the opportunity cost of funds (the horizon- 
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tal line fixed at 7). The opportunity cost is the 
return from funds if they were invested for 
purposes unrelated to disadvantaged children. 


Conclusions 


Investing in disadvantaged young children is a 
rare public policy initiative that promotes fairness 
and social justice and at the same time promotes 
productivity in the economy and in society at 
large. Early interventions targeted toward 
disadvantaged children have much higher returns 
than later interventions such as reduced pupil- 
teacher ratios, public job training, convict reha- 
bilitation programs, tuition subsidies, or expend- 
iture on police. At current levels of resources, 
society overinvests in remedial skill investments 
at later ages and underinvests in the early years. 

Although investments in older disadvantaged 
individuals realize relatively less return overall, such 
investments are still clearly beneficial. Indeed, the 


Table 1. Economic benefits and costs of the Perry 
Preschool Program (27). All values are discounted at 
3% and are in 2004 dollars. Earnings, Welfare, and 
Crime refer to monetized value of adult outcomes 
(higher earnings, savings in welfare, and reduced 
costs of crime). K-12 refers to the savings in reme- 
dial schooling. College/adult refers to tuition costs. 


Perry Preschool 


Child care $986 
Earnings $40,537 
K-12 $9184 
College/adult $-782 
Crime $94,065 
Welfare $355 
Abuse/neglect $0 
Total benefits $144,345 
Total costs $16,514 
Net present value $127,831 
Benefits-to-costs ratio 8.74 


advantages gained from effective early inter- 
ventions are sustained best when they are followed 
by continued high-quality learning experiences. The 
technology of skill formation shows that the returns 
on school investment and postschool investment are 
higher for persons with higher ability, where ability 
is formed in the early years. Stated simply, early 
investments must be followed by later investments 
if maximum value is to be realized. 


References and Notes 

1. E. |. Knudsen, J. J. Heckman, J. Cameron, J. P. Shonkoff, 
Proc. Natl. Acad. Sci. U.S.A., in press. 

2. J. P. Shonkoff, D. Phillips, From Neurons to Neighborhoods: 
The Science of Early Child Development (National 
Academies Press, Washington, DC, 2000). 

3. P. Carneiro, J. J. Heckman, in /nequality in America: What 
Role for Human Capital Policies? }. J. Heckman, A. B. 
Krueger, B. Friedman, Eds. (MIT Press, Cambridge, MA, 
2003), ch. 2, pp. 77-237. 

4. F. Cunha, J. J. Heckman, L. J. Lochner, D. V. Masterov, 
in Handbook of the Economics of Education, 

E. A. Hanushek, F. Welch, Eds. (North Holland, 
Amsterdam, in press). 

5. J.J. Heckman, D. V. Masterov, “The productivity argument for 
investing in young children,” (Working Paper No. 5, Committee 
on Economic Development, Washington, DC, 2004). 

6. J. B. Delong, L. Katz, C. Goldin, in Agenda for the Nation, 
H. Aaron, J. Lindsay, P. Nivola, Eds. (Brookings Institution 
Press, Washington, DC, 2003), pp. 17-60. 

7. D.T. Ellwood, in The Roaring Nineties: Can Full Employment 
Be Sustained? A. Krueger, R. Solow, Eds. (Russell Sage 
Foundation, New York, 2001), pp. 421-489. 

8. J. J. Heckman, P. LaFontaine, J. Lab. Econ., in press. 

9. International Adult Literacy Survey, 2002: User's Guide, 
Statistics Canada, Special Surveys Divison, National 
Literacy Secretariat, and Human Resources Development 
Canada (Statistics Canada, Ottawa, Ontario, 2002). 

10. D. A. Anderson, J. Law Econ. 42, 611 (1999). 

11. J. J. Heckman, J. Stixrud, S. Urzua, J. Lab. Econ., in press. 

12. Westinghouse Learning Corporation and Ohio University, 
The Impact of Head Start: An Evaluation of the Effects of 
Head Start on Children’s Cognitive and Affective 
Development, vols. 1 and 2 (Report to the Office of 
Economic Opportunity, Athens, OH, 1969). 

13. L. J. Schweinhart et al., Lifetime Effects: The High/Scope 
Perry Preschool Study Through Age 40 (High/Scope, 
Ypsilanti, Ml, 2005). 


19. 


20. 


21. 


22. 


28. 
29. 


. A. Rolnick, R. Grunewald, “Early childhood development: 
Economic development with a high public return” (Tech. 
rep., Federal Reserve Bank of Minneapolis, Minneapolis, 
MN, 2003). 

. C.T. Ramey, S. L. Ramey, Am. Psychol. 53, 109 (1998). 

. C.T. Ramey, S. L. Ramey, Prev. Med. 27, 224 (1998). 

. C.T. Ramey et al., Appl. Dev. Sci. 4, 2 (2000). 

. A. J. Reynolds, M. C. Wang, H. J. Walberg, Early 

Childhood Programs for a New Century (Child Welfare 

League of America Press, Washington, DC, 2003). 

L. A. Karoly et al., Investing in Our Children: What We 

Know and Don’t Know About the Costs and Benefits of Early 

Childhood Interventions (RAND, Santa Monica, CA, 1998). 

L. N. Masse, W. S. Barnett, A Benefit Cost Analysis of the 

Abecedarian Early Childhood Intervention (Rutgers 

University, National Institute for Early Education 

Research, New Brunswick, NJ, 2002). 

J. S. Coleman, Equality of Educational Opportunity (U.S. 

Deparment of Health, Education, and Welfare, Office of 

Education, Washington, DC, 1966). 

S. W. Raudenbush, “Schooling, statistics and poverty: 

Measuring school improvement and improving schools” 

Inaugural Lecture, Division of Social Sciences, University 

of Chicago, Chicago, IL, 22 February 2006. 

. J. J. Heckman, M. I. Larenas, $. Urzua, unpublished data. 

. D. A. Neal, in Handbook of Economics of Education, 

E. Hanushek, F. Welch, Eds. (Elsevier, Amsterdam, in 
press). 

. B. Krueger, D. M. Whitmore, Econ. J. 111, 1 (2001). 

. B. Krueger, D. M. Whitmore, in Bridging the Achievement 
Gap, }. E. Chubb, T. Loveless, Eds. (Brookings Institution 
Press, Washington, DC, 2002). 

. W. S. Barnett, Benefit-Cost Analysis of Preschool Education, 

2004, (http://nieer.org/resources/files/BarnettBenefits. ppt). 

F. Cunha, J. J. Heckman, J. Hum. Resour., in press. 

This paper was generously supported by NSF (grant nos. 

SES-0241858 and SES-0099195), National Institute of 

Child Health and Human Development (NIH grant no. 

RO1HD043411), funding from the Committee for Eco- 

nomic Development, with a grant from the Pew 

Charitable Trusts and from the Partnership for America’s 

Economic Success. This research was also supported by 

the Children’s Initiative project at the Pritzker Family 

Foundation and a grant from the Report to the Nation of 

America’s Promise. The views expressed in this paper are 

those of the author and not necessarily those of the 

sponsoring organizations. See our Web site (http:// 
jenni.uchicago.edu/econ_neurosci) for more information. 


10.1126/science.1128898 


PERSPECTIVE 


Studying Adolescence 


Linda M. Richter 


Young people in their teens constitute the largest age group in the world, in a special stage 
recognized across the globe as the link in the life cycle between childhood and adulthood. 
Longitudinal studies in both developed and developing countries and better measurements of 
adolescent behavior are producing new insights. The physical and psychosocial changes that occur 
during puberty make manifest generational and early-childhood risks to development, in the form 
of individual differences in aspects such as growth, educational attainment, self-esteem, peer 
influences, and closeness to family. They also anticipate threats to adult health and well-being. 
Multidisciplinary approaches, especially links between the biological and the social sciences, as well 
as studies of socioeconomic and cultural diversity and determinants of positive outcomes, are 
needed to advance knowledge about this stage of development. 


oung people aged 10 to 19 currently making up close to 20% of the 6.5 billion world 
constitute a demographic bulge. They population estimated in 2005 (7), 85% of whom 
are the largest age group in the world, live in developing countries and account for 


about one-third of those countries’ national pop- 
ulations. Adolescence has also been described as 
“demographically dense”: a period in life during 
which a large percentage of people experience a 
large percentage of key life-course events (2). 
These include leaving or completing school, 
bearing a child, and becoming economically 
productive. They also include experiences, more 
common in this age group than in others, that 
are capable of substantially altering life trajecto- 
ries: nonconsensual sex, alcohol and drug abuse, 
self-harm and interpersonal violence, and getting 
into trouble with the law. Diet and activity pat- 
terns, friendships, educational achievement, and 
civic involvement all affect current health, 
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schooling, and family life, but they also have 
long-term effects on well-being in adulthood 
and even on future generations. 

Adolescence was long thought to be an 
American cultural invention, a by-product of in- 
dustrialization; a personally and socially proble- 
matic period created by the dependence of young 
people on their parents after leaving school and 
waiting to find work (3). However, investiga- 
tions of hundreds of societies confirm that 
adolescence is a universally recognized life 
stage, starting around or just after puberty, al- 
though with different markers, behavioral man- 
ifestations, and social attributions (4-7). In most 
societies, the onset of adolescence is celebrated 
through rituals associated with prospective adult 
roles, such as reproduction, responsibility, and 
work; or with religious ceremonies, often dif- 
ferentiated by gender. In highly industrialized 
societies, the rites of passage are less public and 
more variable, but the period between childhood 
and adulthood may be bridged by changes in 
schooling, changes in family rules about auton- 
omy, “first-time” experiences such as drinking 
alcohol, or inauguration into a group (8, 9). 

After the publication in 1904 of a widely 
influential book by Hall (0), there was a per- 
vasively held opinion that the “raging hor- 
mones” of puberty inevitably led to rebellion, 
conflict with parents and other authorities, etc. 
This simplistic idea has been shown to be false 
(9). Most adolescents don’t go through a period 
of “sturm und drang.” Rather, the period of 
transition to adulthood—largely a socially des- 
ignated period, with an onset that is generally 
precipitated by apparent physical changes as- 
sociated with puberty—involves multiple in- 
teractions between biology and culture (or the 
set of social institutions and relationships prev- 
alent at the time). Unidirectional models that 
assume that hormones cause behavior (for 
example, that testosterone causes aggression) 
or that behaviors cause hormone change (for 
example, that stress increases cortisol levels) 
have given way to models of hormone/behavior 
interactions and theories that take context into 
account (J0). For example, poor attachment, 
family discord, and low investment in children 
are believed to affect the timing of puberty on- 
set (//). In turn, the combination of these stress- 
ors and early puberty contributes to conflict 
with parents (/2), lower self-esteem (/3), and 
associations with deviant peers (/4). Neuro- 
physiological and brain imaging studies have 
demonstrated brain reorganization during ado- 
lescence coincident with the onset of puberty, 
which may make adolescents more sensitive to 
experiences that affect their judgement (/5). 


New Designs, New Methods 

Time is an important dimension in understand- 
ing all developmental stages; at the individual 
level, in the interaction between individuals and 


changing sociocultural institutions and prac- 
tices, and in unfolding historical events. In 
1974, Glen Elder published Children of the 
Great Depression, a seminal work based on ar- 
chival data from the 1920s (/6). His analysis 
demonstrated how the growing independence of 
adolescent boys from their families cushioned 
the blow of economic shocks on their house- 
holds. Other longitudinal studies, such as the 
1946 British National Birth Cohort Follow-Up 
Study (17), the 1956 New York Longitudinal 
Study (78), and the National Longitudinal Study 
of Adolescent Health in the USA (www.cpc.unc. 
edu/addhealth) (2), have similarly shown the 
value of a life-course approach to understanding 
the development of young people. 


Fig. 1. Differences in growth and maturation 
between young women in Bt20, among young 
women living in the same city and born within 7 
weeks of one another in 1990. (The young women 
and their caregivers gave consent for the photo- 
graphs to appear in scholarly journals.) 


Costly as they are to maintain and complex 
as the data are to analyze, prospective longitu- 
dinal designs are ideally suited to studying hu- 
man development, especially for understanding 
processes of change and the temporal ordering 
of events that are used as one proxy of causality. 
Follow-up of young people across their life 
span, as they interact with family, school, peers, 
and the wider social world, enables understand- 
ing that goes much deeper than the snapshot 
provided by measurement at a single point in 
time. There are now several large-scale birth 
cohort studies in developing countries, including 
Brazil and South Africa, revealing information 
about long-term and generational effects of 
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nutrition and family life, considered against the 
backdrop of the demographic and health tran- 
sitions underway in these countries (/9, 20). 
The Birth to Twenty (Bt20) study in South 
Africa enrolled a cohort of more than 3000 
children in Soweto-Johannesburg in early 1990. 
Nicknamed ‘“‘Mandela’s Children,” this study 
has collected data from before birth to age 15 
(and the study is planned to continue to age 20) 
among the children (born immediately after 
Nelson Mandela’s release from prison) and 
among their families. The second generation, 
children of the cohort, have started to be born, 
with the youngest mother only 14 years old when 
she delivered her baby. This group of young peo- 
ple is the first generation of children to live in a 
democratic South Africa, and the study aims to 
portray the development of individuals and 
groups of individuals as they make their life tran- 
sitions across a particular historical period (2/). 


What Are We Learning? 


A small group of children can be distinguished 
in the first 2 years of life whose adjustment dif- 
ficulties, mainly in the form of problems re- 
lating to peers, persist for much of childhood. 
These problems are fairly well predicted by a 
combination of physiological (low birth weight), 
social (single parenthood), and economic (pov- 
erty) factors (22). Father absence is very high in 
southern Africa, largely because of migrant la- 
bor practices (23). Single-parent female-headed 
households are poorer than others; and men who 
are not married to the mother, either legally or 
traditionally, at the time of a child’s birth, give 
progressively less financial support and time to 
their offspring as they grow older (24). 

Marked differences occur between young 
people in all domains, most apparently in phys- 
ical growth, both in childhood and adolescence 
(Fig. 1). However, it is not only individual dif- 
ferences that can be studied across time but also 
the patterns produced by the clustering of per- 
sonal profiles, conditions, and contexts (25). 
Young people with different characteristics of 
physical growth or event onset such as puberty or 
sexual activity can be investigated with respect to 
the antecedents, consequences, and correlates of 
particular patterns. For example, children who 
show signs of rapid “catch-up” weight gain dur- 
ing infancy tend to have greater fat mass, poorer 
glucose tolerance, and increased risk of obesity, 
diabetes, and cardiovascular disease in later life 
(26). Boys and girls in Bt20 who were in a more 
advanced stage of puberty at 13 years of age 
were more likely to be engaging in a variety of 
activities, such as smoking, experimenting with 
drugs, and sexual activity, than were their less 
developed peers. 

Like other developmental stages, puberty has 
considerable individual variability. Pubertal stag- 
ing is influenced by a number of generational, 
social, and biological factors. The age at which 
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young people enter puberty has declined all over 
the world, largely as a result of improvements in 
socioeconomic conditions and nutrition (27). In 
South Africa, for example, menarche has de- 
creased by 0.73 years per decade among girls in 
urban environments, with the last reported mean 
age being 13.2 years (28), still a whole year later 
than among African-American girls in the 
United States (29). At the same time, pubertal 
timing and related physical and psychosocial 
factors are also strong determinants of risks for 
adult outcomes, including poor sexual and re- 
productive health (30), social problems (3/), and 
chronic diseases in later life (32). For example, 
early menarche is associated with initiation of 
sexual activity, both early and late puberty are 
coupled with changes in self-esteem among 
boys and girls, and weight gain among girls in 
puberty is related to later risk of hypertension 
and diabetes. 

Accurate measurement of pubertal staging in 
community-based studies in non-Western soci- 
eties, particularly Africa, has only recently 
become possible, with careful validation of 
self-assessment of hair growth and breast and 
gonad development against established criteria, 
such as Tanner’s Sexual Maturation Scale (33). 
Relating pubertal staging to risk behaviors also 
requires that the latter be accurately assessed. 
Questions about behaviors that are unlawful or 
socially sanctioned (such as sex, drug use, and 
truancy) have the greatest potential to be under- 
reported. New methods are becoming available 
to estimate these behaviors and correct these 
problems. These include the use of biological 
markers of behavior, such as the detection of 
salivary cotinine or thiocyanate to determine un- 
derreporting of smoking (34). Because cotinine, 
manufactured in the body, is a by-product of 
nicotine, cotinine measures are a good proxy for 
ingestion of or exposure to nicotine. Besides 
cotinine, we are also using urinary leukocyte 
esterase (ULE) tests in Bt20, in addition to the 
consistency of reports over time, as a screen for 
HIV risk and to estimate underreporting of sex- 
ual activity among young adolescents (35). ULE 
tests are more sensitive in females than males 
and require confirmatory microscopy. Further- 
more, urinary tract infections can be acquired in 
ways other than through sexual intercourse. 
Nonetheless, at age 13 years, positive ULE tests 
were found in twice the number of girls who 
reported having been sexually active and in an 
additional 50% of girls who reported having 
had sex at 14 and 15 years of age. Positive ULE 
tests were associated with subsequent informa- 
tion indicating the likelihood of early sexual 
activity in 16% of adolescents at 13 years of 
age and in 21% at 14 who reported that they 
were not sexually active. 

Also important are efforts to improve the 
accuracy of adolescent self-reports of potentially 
sensitive information. Recent technological de- 


velopments have given rise to audio computer- 
assisted self-interviewing (ACASI). Adolescents 
read questions on the screen or listen to them 
through headphones in a language of their 
choice and enter their responses directly through 
a standard or modified keyboard. Studies in 
Bt20, Zimbabwe, and Kenya indicate that re- 
spondents prefer the privacy afforded, and some 
young people rate themselves as being more 
honest in their replies. User problems with the 
technology are still challenges, awaiting easier- 
to-use options before its application among 
adolescents with low levels of education can 
be expanded (36). Other experimental tech- 
niques include the presentation of and response 
to sensitive questions through personal digital 
assistants and mobile phones. 

We are now entering a period where we can 
build on new capabilities for gathering data. 
Reliable and valid measurement, especially of 
data derived from young people’s self-reports, is 
the essential foundation for rigorous evaluation 
of programs to improve conditions for optimal 
adolescent development. For the next 5 years of 
the Bt20 program, we are concentrating on mea- 
sures of outcomes: achieving in school or drop- 
ping out, having an unwanted teenage pregnancy 
or completing school, weight gain, high diabetes 
indicators, conflict with the law, and so on. We 
are also starting to enroll the next generation— 
young Bt20 boys and girls are starting to have 
babies—and this gives us an excellent oppor- 
tunity to study intergenerational advantages and 
disadvantages. 

Bt20 and other longitudinal studies offer in- 
sights into predisposing conditions for beneficial 
and adverse outcomes in the adolescent years. 
Many of the findings of these studies emphasize 
the importance of early and systemic interven- 
tion. A good start in life, affectionate and stable 
family relationships, school and neighborhood 
support for young people’s development, and 
the like, all predict good outcomes for young 
people in terms of school achievement, adjust- 
ment, civic engagement, and future aspirations. 
However, there are still reasons for optimism 
even when children are exposed to extremely 
dysfunctional circumstances (37, 38). For exam- 
ple, three decades ago, Garmezy and his col- 
leagues found that although having a parent 
with schizophrenia did increase children’s risk 
for the illness, 90% of the chidren they studied 
had “good peer relations, academic achieve- 
ment, commitment to education and to pur- 
posive life goals, early and successful work 
histories” [(37), p. 114]. This is also true of 
conditions of poverty, conflict and violence, and 
parental substance abuse and criminality; most 
children exposed to these conditions grow up to 
lead successful lives as adults, with the capacity 
to love and work (38). Self-stabilizing tenden- 
cies enable many children and young people 
growing up in difficult circumstances to take 


advantage of even slender opportunities to par- 
ticipate in social activities with others, achieve at 
what they do and be valued, and contribute to the 
well-being of those they care about. Opportunity 
niches can be created by winning a race, being 
selected for a team or cast in a school play, or 
having a supportive family member even if not a 
parent, including a teacher who shows interest. 
All of these can change how a child sees him- or 
herself and how others see and treat him or her. 

We have been surprised by the extent to 
which this is true in the Bt20 cohort. Of the 
2300 children followed up to age 16, over 50% 
lived in very poor conditions (less than $1 per 
day per person), 20% frequently went to bed 
hungry during their early years, more than 40% 
had direct or vicarious experience of community 
or family violence, and only two out of five 
children had ever lived with their fathers (23). 
Despite these conditions, we currently estimate 
that only about 5% of the children showed 
persistent behavioral difficulties in their pre- 
school and early school years, had started smok- 
ing or carrying a weapon by age 14, or been in 
conflict with the law. However, as young people 
enter into their teen years, travel further away 
from home to school, are subject to less parental 
monitoring and supervision, and are increasingly 
exposed to peers who engage in risky behavior 
behavior, the rates of potentially problematic 
behaviors increase. For example, although only 
1.6% of 13-year-olds (3.3% of boys and 0.8% of 
girls) have had sexual intercourse, this rate rises 
to close to 20% at 15 years of age (27% among 
boys and 12% among girls). A composite risk 
score, combining rates of smoking, alcohol and 
drug use, foreplay, and weapon carrying, shows 
a significant increase with increased pubertal de- 
velopment and the transition from primary to 
secondary schooling. Young people most likely 
to be taking risks in their early teens are those 
who are advanced in their pubertal development 
for their age and in environments where they are 
exposed to older adolescents, without monitoring 
and supervision by caring adults. 


Conclusion 


The adolescent years, and especially puberty, 
link the impact of generational and early- 
childhood factors to adult outcomes. Longitu- 
dinal studies are demonstrating that it is also an 
age of opportunity: Good nutrition and healthy 
lifestyle, positive family and school influences, 
and access to supportive services, among other 
factors, can help young people break early pat- 
terns leading to ill health and poor social ad- 
justment, with benefits for adult well-being and 
the next generation of children and youth. New 
methods are increasing the accuracy and valid- 
ity of data collected from young people, and 
developments in both the biological and social 
sciences are providing unprecedented opportu- 
nities (39). However, we still know a lot more 
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about what goes wrong in adolescence and why, 
and a lot less about how to prevent problems and 
how to get young people back on track, es- 
pecially in those areas of the world in which 
young people face the greatest challenges. New 
knowledge is being driven by the need to 
develop and test interventions to promote the 
physical and psychological well-being of young 
people and counteract the risks associated with 
this developmental stage. This is especially true 
in a world in which many adolescents face the 
same threats—incomplete or poor-quality edu- 
cation, limited prospects for satisfying work, 
marginalization, HIV/AIDS and other sexually 
transmitted infections, substance abuse, violence, 
anxiety and depression—without the same op- 
portunities for help and support. 
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PERSPECTIVE 


Politics and the Life Cycle 


Donald R. Kinder 


The study of politics and the life cycle began with a rather single-minded focus on childhood 
and the family—on the idea, as Tocqueville famously put it, that the entire person could be 
“seen in the cradle of the child.” Politics does begin in childhood, and parents do influence their 
offspring, but change takes place over the entire span of life. | take up the early emergence of 
partisanship and essentialism, the formation of generations, politically consequential transitions 
in adulthood, and the rising of politics and its final decline. 


y assignment is to highlight important 
Mine and promising developments 

in the study of politics over the life 
cycle: politics, one could say, from the cradle to 
the grave. I focus on exemplary cases, confine 
my attention to the United States, overlook many 
local skirmishes, and concentrate primarily on 
tracing the life cycle path of political belief 
(rather than action). The essay proceeds chron- 
ologically, beginning with childhood. 


Partisanship and Essentialism in Childhood 


What Freud did to assumptions of childhood 
sexual innocence, Greenstein (/) and other pio- 
neers in the field did to assumptions of political 
innocence. Children may be naive and poorly 
informed when it comes to politics, but they are 
far from innocent. They express strong attach- 
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ment to the nation. They think of themselves, 
proudly, as partisans of one party or the other. 
They believe that their country and its way of 
life are best. They happily subscribe to conven- 
tional stereotypes about blacks and whites, men 
and women, and rich and poor (2). 

As to the origins of such beliefs, speculation 
centered initially on the family, on the claim from 
social learning theory that children would imitate 
and eventually internalize what their parents said 
and did (3). The best evidence comes from the 
landmark study carried out by Jennings and 
Niemi in 1965, based on independent interviews 
with a national sample of high school seniors 
and their parents. Jennings and Niemi dis- 
covered that adolescent children did indeed 
seem to copy the party identification of their 
parents (when they could discern it). On most 
matters of belief, however, correspondence be- 
tween parents and offspring was unimpressive: 
moderate on attitudes toward social groups and 


close to negligible on matters of policy. Weak 
correspondence presumably reflects a failure to 
communicate: confusion or conflict among par- 
ents, the evasion of political subjects in family 
discussions, and (perhaps especially) indiffer- 
ence to politics among the children (4). 

An alternative model for parental influence is 
genetic transmission. Until quite recently, the 
assumption that political beliefs are acquired 
through experience has been taken as an article 
of faith. Rapid developments in human behavioral 
genetics have made this stance increasingly difficult 
to maintain. A number of studies have compared 
the political views expressed by monozygotic 
twins (who share an identical genetic inheritance) 
to the views expressed by dizygotic twins (who 
develop from two separate eggs fertilized by two 
separate sperm). The comparisons take into ac- 
count whether twins are reared together or apart, 
and whether they are raised by their biological 
parents or adopted into new families. The results 
suggest that adult political beliefs—on the death 
penalty, say, or on school prayer—have a sizable 
genetic component. On matters of politics, par- 
ents may influence their biological offspring as 
much through the “genetic blueprint” they pro- 
vide at conception as through the modeling and 
instruction they supply later on (5—7). 

Children are not only partisans; they are also, 
according to a recent line of research, essentialists. 
That is, children seem to believe that certain social 
categories are “natural kinds”: real (not constructed) 
and discovered (not invented). More specifically, 
they believe that race and sex and ethnicity 
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belong entirely to the natural world and that 
differences between, say, blacks and whites are 
rooted entirely in biology, or blood, or some such 
underlying essence. Essence “explains” inner 
qualities—temperament, intellect, character—as 
well as outward, physical ones. Children come to 
believe all this, moreover, without instruction. 
Essentialism is thought to be the product of 
evolutionary adaptation: The human cognitive 
system is predisposed to treat social groupings as 
natural kinds (8, 9). This line of work is relevant 
here because it suggests a deep foundation for 
social stereotyping, and social stereotyping is 
in turn a pervasive feature of adult political 
belief. Essentialism conceived of in this way 
clarifies why so much of public opinion is 
“‘group-centric’’: why views on policy, attach- 
ments to party, and votes for candidates depend 
so decisively on the beliefs and feelings people 
harbor toward prominent social groupings (/0). 


Political Generations 


Political generations are created out of the con- 
junction of individual development and political 
history. The formation of a distinctive genera- 
tional perspective requires both the openness of 
late adolescence and early adulthood and the 
intrusion of events such as war, depression, and 
social disorder. Under these circumstances, a 
generation is expected to enter political life with 
a distinctive and largely permanent commit- 
ment to a certain point of view—in possession, 
Mannheim would say, of its own particular 
“historical-social consciousness”’ (//). 

But as a raw empirical matter, instances of 
historical-social consciousness are not that easy 
to find. This is in large part because compre- 
hensive worldviews are exceedingly uncom- 
mon among ordinary citizens, whatever their 
generation (/2, 13). If we look for less majestic 
empirical outcroppings due to generation, how- 
ever, there is good evidence to be found. People 
command more vivid memories and deeper 
knowledge for events that take place during 
their late adolescence and early adulthood. The 
Great Depression, World War II, the civil rights 
movement, the Vietnam War: these events 
loom especially large in the memory of those 
Americans who were just coming of age at the 
time. People know more about such events. 
They invest them with greater importance. And 
perhaps most important, they are predisposed to 
apply the lessons of their generation’s experi- 
ence to contemporary matters (/4, /5). 

Generations are also centrally implicated in 
partisan change. Under normal circumstances, par- 
ty identification is a “durable attachment, not read- 
ily disturbed by passing events and personalities” 
[(16), p. 151; (17, 18)]. But periods of serene sta- 
bility are occasionally interrupted. The exemplary 
case of party realignment in the United States was 
set in motion by the economic calamity that over- 
whelmed the country during the administration of 


Herbert Hoover. In 1932, at first opportunity, 
Hoover was driven out of office; shortly thereafter, 
the Republican Party lost control of Congress; by 
the end of the Depression, the Democrats had be- 
come the majority party. This transformation was 
accomplished in large part by Franklin Roosevelt 
and the Democratic Party capturing, more or less 
permanently, the loyalty of those Americans who 
were about to enter political life in the 1930s. 
Older generations born before 1905 were predom- 
inantly Republican and remained so, whereas 
younger generations born after 1916 became, 
and by and large stayed, Democrats (/6, /9). 


Transitions 


A generational analysis implies that political 
development takes place primarily in the im- 
pressionable years of late adolescence and 
early adulthood, and there is some truth to this 
(8, 20, 21). But if the pace of political devel- 
opment slows in adulthood, it does not halt. 

Passage through adulthood can be analyzed 
as a series of transitions into (and out of) roles: 
soldier, parent, neighbor, and more. To some 
degree, these new roles and the social spaces 
they occupy—partly individually chosen, partly 
environmentally supplied—serve as “‘socializa- 
tion depots.”’ Faced with new norms and fresh 
ideas, people change. 

An excellent illustration of research along 
these lines is Huckfeldt and Sprague’s (22) 
analysis of the “weak ties” that run through 
neighborhoods (23). In theory, neighborhoods 
are politically consequential because “they 
determine proximity and exposure—they serve 
to structure important elements of involuntary 
social interaction” [(22), p. 36]. In a detailed 


Fig. 1. Comstock Images/Corbis 


empirical investigation of an election campaign, 
Huckfeldt and Sprague show how neighbor- 
hoods influence individuals’ political beliefs 
through recurrent processes of social influence. 
Another way to conceive of passage through 
the adult years draws on the idea of self- 
interest—the seemingly straightforward claim 
that citizens are predisposed to support parties 
and policies that advance their own material in- 
terests. From this perspective, life cycle transitions 
become important politically insofar as they gen- 
erate new incentives and distinctive interests. As 
people step into new roles—by purchasing a 
home, enlisting in the armed services, or having 
children—political beliefs should change accord- 
ingly. This seems plausible, but self-interest turns 
out to be a surprisingly unimportant source of 
political belief (24). Under just the right circum- 
stances—when the material benefits or harms of 
a proposed policy are substantial, imminent, and 
well-advertised—self-interest can make a differ- 
ence, as in the lavishly financed and hotly con- 
tested campaign for property tax reform in 
California in 1978 (25). But because these cir- 
cumstances don’t come along very often, self- 
interest contributes little to an understanding of 
how political beliefs change over the life cycle. 


Aging 
Do individuals grow more conservative as they 
age? No. Aging is generally unaccompanied by 
movement to the right (or to the left)—not on 
particular matters of economics or foreign 
policy and not on partisanship, either (26, 27). 
Intensity of partisanship is another matter, 
however. Identification with a political party 
tends to strengthen over the life cycle, evidently 
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as a consequence of the “sheer temporal accu- 
mulation of electoral experience” [(28), p. 150; 
(/8)]. Electoral experience and age are not iden- 
tical; the former can be affected by changes in 
laws and procedures governing who is eligible 
to vote, the creation or suspension of elections 
(as in the fascist interlude in 20th-century 
Germany and Italy), and other external intru- 
sions. The strengthening of partisanship with 
experience may seem a humble result, but it is 
not. Strong partisan attachments are vital to the 
preservation of democratic forms of govern- 
ment, and at the dawn of the 21st century, the 
world is flush with new and fragile democracies. 


Political Action over the Life Cycle 


Belief captures only part of the story of politics 
and the life cycle. What can be said (if tel- 
egraphically) about change and continuity over 
the life cycle of political action (29)? 

Participation in politics in the contemporary 
United States is characterized by huge and per- 
sistent inequalities. Those with more income, 
more skills, and especially more education are 
much more likely to take part in politics across 
virtually all forms of participation (30). The roots 
of such inequality are to be found, in part, in the 
family. Well-educated parents are likely to take 
part in politics and to create homes in which 
there is lively political discussion. Children 
growing up in such environments tend to acquire 
a set of political predispositions—interest, knowl- 
edge, efficacy—that motivate participation in 
politics later on. Well-educated parents are also 
likely to have well-educated children, and 
educational attainment is the single most potent 
predictor of participation in adulthood. Partici- 
pation begins in the family, and so too does 
political inequality (31). 

Over the past three decades, as inequality 
has increased, participation in civic and politi- 
cal life has generally declined. It turns out that 
much of this disengagement from public life is 
generational. Americans who came of age at 
the time of the New Deal or World War II en- 
tered political life at moments of immense con- 
sequence and great common purpose. This 
particular conjunction of individual develop- 
ment and extraordinary political history appears 
to have indelibly marked not just memory and 
belief, as we saw earlier, but also participation 
in the collective enterprises of politics (32). 

Transitions play a prominent role in political 
action as well. Several mechanisms seem to be 
at work here, but all share the same basic 
premise: Participation in politics is costly, and 
potential participants count costs. Participation 
eats up time and sometimes money; it requires 
a variety of skills; it entails foregoing other op- 
portunities (33). 

From this perspective, we would expect that 
transitions in and out of new roles would de- 
press participation, at least in the short run, and 


they do. Being laid off, getting married or di- 
vorced, having children, or changing residence 
all result in diminished participation, as time 
and energy are directed to more pressing per- 
sonal matters (30, 34). 

In the longer run, adult roles are important 
for political action insofar as they “teach’’ skills 
that reduce the costs of participation. Those who 
speak and write well, or who command the in- 
terpersonal arts required for organizing others, 
are more likely as a consequence to take an ac- 
tive part in politics, and such skills are often 
developed and refined through involvement in 
the institutions that constitute civil society: work, 
neighborhood associations, and religious orga- 
nizations (35). 

Transitions are also important from this per- 
spective, as they move people into and out of 
spaces that are sites for political mobilization. To 
enhance their chances of winning an election or 
passing a bill, public officials and political organi- 
zations often use various mobilization tactics. 
They sponsor meetings and rallies, circulate pe- 
titions, request contributions, supply citizens with 
arguments (and even text) with which to bom- 
bard their representatives, and more. That is, 
officials and organizations subsidize the costs of 
participation that people would otherwise have to 
put up entirely on their own. When people 
inhabit roles that make them likely targets of 
mobilization, they are more likely to take part in 
politics (30). 

This framework implies that self-interest 
might be more important to action than to be- 
lief, and this seems to be true as well. When 
predicting who takes part in politics and who 
does not, it is useful to know whose interests 
are directly and immediately at stake (36, 37). 
Self-interest explains not so much where people 
stand, but whether they act. 

Finally, we know that participation in pol- 
itics typically begins at a relatively low level in 
early adulthood, rises steadily through middle 
age as psychological participation in the wider 
world broadens, and then declines toward the 
end of life, as “infirmity defeats experience”’ 
[(30), p. 141]. 


Unfinished Business 


The study of politics and the life cycle began 
with a single-minded focus on childhood and 
the family. Politics does begin in childhood, and 
parents do influence their offspring, through 
instruction and genetic endowment both, but 
change, we now know, takes place over the en- 
tire span of life (38). Such change seems to 
follow two rather distinct paths: one for political 
belief and another for political action. One 
challenge for future research is to attend more 
systematically to the difference between belief 
and action, and to offer, in the end, an under- 
standing of politics and the life cycle from the 
perspective of the whole person. 
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PERSPECTIVE 


Would You Be Happier If You Were 
Richer? A Focusing Illusion 


Daniel Kahneman,* Alan B. Krueger,’2* David Schkade,? Norbert Schwarz,’ Arthur A. Stone? 


The belief that high income is associated with good mood is widespread but mostly illusory. People 
with above-average income are relatively satisfied with their lives but are barely happier than 
others in moment-to-moment experience, tend to be more tense, and do not spend more time in 
particularly enjoyable activities. Moreover, the effect of income on life satisfaction seems to be 
transient. We argue that people exaggerate the contribution of income to happiness because they 
focus, in part, on conventional achievements when evaluating their life or the lives of others. 


ost people believe that they would be 
Mer if they were richer, but survey 
evidence on subjective well-being is 
largely inconsistent with that belief. Subjective 
well-being is most commonly measured by ask- 
ing people, “All things considered, how satis- 
fied are you with your life as a whole these 
days?” or “Taken all together, would you say 
that you are very happy, pretty happy, or not too 
happy?” Such questions elicit a global evalua- 
tion of one’s life. An alternative method asks 
people to report their feelings in real time, 
which yields a measure of experienced affect 
or happiness. Surveys in many countries con- 
ducted over decades indicate that, on average, 
reported global judgments of life satisfaction 
or happiness have not changed much over the 
last four decades, in spite of large increases in 
real income per capita. Although reported life 
satisfaction and household income are posi- 
tively correlated in a cross section of people at a 
given time, increases in income have been found 
to have mainly a transitory effect on individuals’ 
reported life satisfaction (/—3). Moreover, the 
correlation between income and subjective well- 
being is weaker when a measure of experienced 
happiness is used instead of a global measure. 
When people consider the impact of any 
single factor on their well-being—not only 
income—they are prone to exaggerate its im- 
portance. We refer to this tendency as the fo- 
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cusing illusion. Standard survey questions on life 
satisfaction by which subjective well-being is 
measured may induce a form of focusing illusion, 
by drawing people’s attention to their relative 
standing in the distribution of material well-being 
and other circumstances. More importantly, the 
focusing illusion may be a source of error in 
significant decisions that people make (4). 
Evidence for the focusing illusion comes 
from diverse lines of research. For example, 
Strack and colleagues (5) reported an experiment 
in which students were asked: (i) “How happy 
are you with your life in general?” and (ii) 
“How many dates did you have last month?” 
The correlation between the answers to these 


questions was —0.012 (not statistically different 
from 0) when they were asked in the preceding 
order, but the correlation rose to 0.66 when the 
order was reversed with another sample of 
students. The dating question evidently caused 
that aspect of life to become salient and its 
importance to be exaggerated when the respon- 
dents encountered the more general question 
about their happiness. Similar focusing effects 
were observed when attention was first called to 
respondents’ marriage (6) or health (7). One 
conclusion from this research is that people do 
not know how happy or satisfied they are with 
their life in the way they know their height or 
telephone number. The answers to global life 
satisfaction questions are constructed only when 
asked (8), and are, therefore, susceptible to the 
focusing of attention on different aspects of life. 

To test the focusing illusion regarding income, 
we asked a sample of working women to estimate 
the percentage of time that they had spent in a bad 
mood in the preceding day. Respondents were 
also asked to predict the percentage of time that 
people with pairs of various life circumstances 
(Table 1), such as high- and low-income, typi- 
cally spend in a bad mood. Predictions were 
compared with the actual reports of mood pro- 
vided by respondents who met the relevant cir- 
cumstances. The predictions were biased in two 
respects. First, the prevalence of bad mood was 


Table 1. The focusing illusion: Exaggerating the effect of various circumstances on well-being. The 
question posed was “Now we would like to know overall how you felt and what your mood was like 
yesterday. Thinking only about yesterday, what percentage of the time were you: in a bad 
mood___%, a little low or irritable___—'%, in a mildly pleasant mood___%, in a very good 
mood___%." Bad mood reported here is the sum of the first two response categories. A parallel 
question was then asked about yesterday at work. Bad mood at work was used for the supervision 
and fringe benefits comparisons. Data are from (14). Reading down the Actual column, sample sizes 
are 64, 59, 75, 237, 96, 211, 82, 221, respectively; reading down the Predicted column, sample sizes 
are 83, 83, 84, 84, 83, 85, 85, 87, respectively. Predicted difference was significantly larger than actual 
difference by a f test; see asterisks. 


Percentage of time in a bad mood 


ene anne Actual _— Predicted one riemstee 

difference difference 

Household income <$20,000 32.0 57.7 12.2 32.0*** 
>$100,000 19.8 25.7 

Woman over 40 years old Alone 21.4 41.1 -1.7 13.2*** 
Married 23.1 27.9 

Supervision at work Definitely close 36.5 64.3 17.4 42.1*** 
Definitely not close 19.1 22.3 

Fringe benefits No health insurance 26.6 49.7 4.5 30.5*** 
Excellent benefits 22.2 19.2 


***P < 0.001. 
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generally overestimated. Second, consistent with 
the focusing illusion, the predicted prevalence of 
a bad mood for people with undesirable circum- 
stances was grossly exaggerated. 

The focusing illusion helps explain why the 
results of well-being research are often counter- 
intuitive. The false intuitions likely arise from a 
failure to recognize that people do not continu- 
ously think about their circumstances, whether 
positive or negative. Schkade and Kahneman (9) 
noted that, “Nothing in life is quite as important 
as you think it is while you are thinking about 
it.” Individuals who have recently experienced a 
significant life change (e.g., becoming disabled, 
winning a lottery, or getting married) surely think 
of their new circumstances many times each day, 
but the allocation of attention eventually changes, 
so that they spend most of their time attending to 
and drawing pleasure or displeasure from expe- 
riences such as having breakfast or watching 
television (/0). However, they are likely to be 
reminded of their status when prompted to answer 
a global judgment question such as, “How 
satisfied are you with your life these days?” 

The correlation between household income 
and reported general life satisfaction on a numeric 
scale (i.e., global happiness as distinct from 
experienced happiness over time) in U.S. samples 
typically ranges from 0.15 to 0.30 (//). The 
relation between global happiness and income 
for 2004 with data from the General Social 


Survey (GSS) is illustrated in Table 2. Those 
with incomes over $90,000 were nearly twice as 
likely to report being “very happy” as those with 
incomes below $20,000, although there is hardly 
any difference between the highest income group 
and those in the $50,000 to $89,999 bracket. 
There are reasons to believe that the cor- 
relation between income and judgments of life 
satisfaction overstates the effect of income on 
subjective well-being. First, increases in income 
have mostly a transitory effect on individuals’ 
reported life satisfaction (2, 12). Second, large 
increases in income for a given country over 
time are not associated with increases in aver- 
age subjective well-being. Easterlin (/), for 
example, found that the fivefold increase in real 
income in Japan between 1958 and 1987 did 
not coincide with an increase in the average 
self-reported happiness level there. Third, al- 
though average life satisfaction in countries 
tends to rise with gross domestic product 
(GDP) per capita at low levels of income, there 
is little or no further increase in life satisfaction 
once GDP per capita exceeds $12,000 (3). 
Fourth, when subjective well-being is mea- 
sured from moment to moment—either by 
querying people in real time with the Ecolog- 
ical Momentary Assessment (EMA) technique 
(73) or by asking them to recall their feelings 
for each episode of the previous day with the 
Day Reconstruction Method (DRM) (/4)— 


Table 2. Distribution of self-reported global happiness by family income, 2004. The GSS question 
posed was “Taken all together, how would you say things are these days—would you say that you 
are very happy, pretty happy, or not too happy?” Sample size was 1173 individuals. 


Percentage indicating global happiness at family income of 


$50,000-$89,999 $90,000 and over 


Response 
Under $20,000 $20,000-$49,999 
Not too happy 17.2 13.0 
Pretty happy 60.5 56.8 
Very happy 22.2 30.2 


7.7 5.3 
50.3 51.8 
41.9 42.9 


Table 3. Correlations between selected life circumstances and subjective well-being measures. The 
question posed was “We would like to know how you feel and what mood you are in when you are 
at home. When you are at home, what percentage of the time are you in a bad mood___—*%, a 
little low or irritable____%o, in a mildly pleasant mood___—%, in a very good mood___—%." The 
last two response categories were added together to obtain the percentage of time in a good mood. 
Duration-weighted “happy” is the average of each person's duration-weighted average rating of 
the feeling happy over episodes of the day, where 0 refers to “not at all” and 6 refers to “very 
much,” and each individual’s responses were weighted by the duration of the episode. Sample 
consists of 740 women from Columbus, Ohio, who completed the DRM in May 2005 (16). 


er Life Amount of day . é ‘i ‘i 
Characteristic satiation in good mood (%) Duration- weighted “happy 
Household income 0.32*** 0.20*** 0.06 
Married 0.21*** 0.15*** 0.03 
Years of education 0.16*** 0.13*** 0.03 
Employed 0.14*** 0.12** 0.01 
Body mass index —-0.13*** —0.08* —0.06 
*P < 0.05; *P < 0.01; ***P < 0.001. 
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income is more weakly correlated with experi- 
enced feelings such as momentary happiness 
averaged over the course of the day (henceforth 
called duration-weighted or experienced happi- 
ness) than it is with a global judgment of life 
satisfaction or overall happiness, or with a global 
report of yesterday’s mood (Table 3) (15, /6). 
This pattern is probably not a result of greater 
noise in the duration-weighted happiness mea- 
sure than in life satisfaction (77). Other life cir- 
cumstances, such as marital status, also exhibit 
a weaker correlation with duration-weighted 
happiness than with global life satisfaction. 

An analysis of EMA data also points to a 
weak and sometimes perverse relation between 
experienced affect and income. Specifically, we 
examined EMA data from the Cornell Work- 
Site Blood Pressure Study of 374 workers at 10 
work sites, who were queried about their inten- 
sity of various feelings on a 0 to 3 scale every 25 
min or so during an entire workday (/8). The 
correlation between personal income and the 
average happiness rating during the day was 
just 0.01 (P = 0.84), whereas family income 
was significantly positively correlated with 
ratings of angry/hostile (r = 0.14), anxious/tense 
(r = 0.14), and excited (r = 0.18). Thus, higher 
income was associated with more intense nega- 
tive experienced emotions and greater arousal, 
but not greater experienced happiness. 

Why does income have such a weak effect 
on subjective well-being? There are several ex- 
planations, all of which may contribute to vary- 
ing degrees. First, Duesenberry (/9), Easterlin 
(2), Frank (20), and others have argued that 
relative income rather than the level of income 
affects well-being—earning more or less than 
others looms larger than how much one earns. 
Indeed, much evidence indicates that rank within 
the income distribution influences life satisfac- 
tion (2/—23). As society grows richer, average 
rank does not change, so the relative income 
hypothesis could explain the stability of av- 
erage subjective well-being despite national 
income growth. The importance placed on rel- 
ative income may also account for the stronger 
correlation between income and global life 
satisfaction than between income and experi- 
enced affect, as life satisfaction questions prob- 
ably evoke a reflection on relative status that is 
not present in moment-to-moment ratings of 
affect. The relative income hypothesis cannot by 
itself explain why a permanent increase in an 
individual’s income has a transitory effect on her 
well-being, as relative standing would increase. 
However, the increase in relative standing can be 
offset by changes in the reference group: After a 
promotion, the new peers increasingly serve as a 
reference point, making the improvement rela- 
tive to one’s previous peers less influential (24). 

Second, Easterlin (/, 2) argues that individuals 
adapt to material goods, and Scitovsky (25) 
argues that material goods yield little joy for 
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Table 4. How is time spent and do the activities bring happiness? Time 
allocation is weighted-average percentage of the nonsleep day for each 
sampled observation from the American Time-Use Survey (30). Weighted 
average of weekday (5 out of 7) and weekend (2 out of 7) is presented. 


Sample consists of 3917 men and 4944 women age 18 to 60. Last two rows 
were computed by authors from a DRM survey of 810 women in Columbus, 
Ohio, in May 2005; if multiple activities were performed during an episode, 
the activity refers to the one that was selected as “most important.” 


Family income/Gender Active leisure Eating Passive leisure Compulsory Work and commute Other 
Men Time allocation (%) 
<$20,000 6.6 6.6 34.7 20.8 29.1 2.1 
$20,000-$99,999 8.1 2 26.4 21.8 35.4 1.1 
$100,000+ 10.2 8.6 19.9 23.6 36.9 0.8 
Women 
<$20,000 5.3 5.7 33.5 35.6 18.5 1.4 
$20,000-$99,999 7.5 6.7 23.8 34.3 26.7 1.0 
$100,000+ 9.1 7.0 19.6 35.9 27.3 11. 
Women Feelings (0-6 scale) 
Happy 4.67 4.45 4.21 4.04 3.94 4.25 
Tense/Stressed 0.92 1.17 1.30 1.80 2.00 1.61 


most individuals. Thus, increases in income, 
which are expected to raise well-being by raising 
consumption opportunities, may in fact have little 
lasting effect because of hedonic adaptation or 
because the consumption of material goods has 
little effect on well-being above a certain level of 
consumption (26). Moreover, people’s aspirations 
adapt to their possibilities and the income that 
people say they need to get along rises with in- 
come, both in a cross section and over time (27). 

Finally, we would propose another explana- 
tion: As income rises, people’s time use does 


happiness, although it could increase tension and 
one’s sense of accomplishment and satisfaction. 

Despite the weak relation between income and 
global life satisfaction or experienced happiness, 
many people are highly motivated to increase 
their income. In some cases, this focusing illusion 
may lead to a misallocation of time, from ac- 
cepting lengthy commutes (which are among the 
worst moments of the day) to sacrificing time 
spent socializing (which are among the best 
moments of the day) (28, 29). An emphasis on 
the role of attention helps to explain both why 


satisfaction and with disaggregated measures of emo- 
tional experience at particular times. 

16. D. Kahneman, D. Schkade, C. Fischler, A. Krueger, A. Krilla, 
“A study of well-being in two cities,” Discussion Paper No. 53, 
Center for Health and Wellbeing, Princeton, NJ, 2006. 

17. We conducted a reliability study of the DRM that asked 
the same questions of 229 women two weeks apart, and 
found about the same two-week serial correlation in 
duration-weighted happiness as in life satisfaction for 
the respondents. 

18. P. Schnall, J. Schwartz, P. Landsbergis, K. Warren, 

T. Pickering, Psychosom. Med. 60, 697 (1998). 

19. J. Duesenberry, /ncome, Saving, and the Theory of 

Consumer Behavior (Harvard Univ. Press, Cambridge, MA, 


1949). 
not appear to shift toward activities that are many people seek high income—because their 20. R. Frank, Luxury Fever (Princeton Univ. Press, Princeton, 
associated with improved affect. Subjective predictions exaggerate the increase in happiness NJ, 1999). 


well-being is connected to how people spend 
their time. In a representative, nationwide sam- 
ple, people with greater income tend to devote 
relatively more of their time to work, compul- 
sory nonwork activities (such as shopping and 
childcare), and active leisure (such as exercise) 
and less of their time to passive leisure activities 
(such as watching TV) (Table 4). The activities 
that higher-income individuals spend relatively 


due to the focusing illusion—and why the long- 
term effect of income gains become relatively 
small, because attention eventually shifts to less 
novel aspects of daily life. 
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PERSPECTIVE 
Redistributing Work in Aging Europe 
James W. Vaupel and Elke Loichinger 


As Europe ages, the proportion of people who work will decline unless older individuals remain in 
the labor force. Such reform could be part of a more general redistribution of work. If a greater 
share of the population worked, then the average number of hours worked per week could be 
reduced. This could particularly help younger people and increase Europe’s low birth rates. The 
challenges facing Germany, Europe's most populous country, are highlighted, but statistics are also 
given for five other European countries and, for comparison, the United States. Social science 
research is needed to provide policy-relevant knowledge about life-course options. 


urope, the oldest continent, is growing 
BK Low birth rates (/, 2), rising life ex- 

pectancy (/, 3—5), and, to a lesser extent, 
migration flows (/) are reshaping the “pyramids” 
that describe population composition by age and 
sex (Fig. 1). The new demography poses chal- 
lenges to current labor practices and policies and 
offers individuals opportunities for greater life- 
course choice. We summarize the social science 
research needed to help societies meet the chal- 
lenges and to help individuals take advantage 
of the opportunities. We highlight Germany, Eu- 
rope’s most populous country, but also present 
data on five other members of the European 
Union and, for comparison, the United States. 


Two Indicators of Demographic Change 


Traditionally, the burden of old-age dependency 
has been summarized by the ratio of the number 
of people above 60 to the number of people 
between 20 and 60. We introduce two “‘Rostock 
indicators” that we believe are more informa- 
tive for summarizing the magnitude of the 


economic and social challenges caused by 
population aging (Table 1). The first is based 
on a simple measure of labor force participation, 
namely the ratio of nonworkers to workers, with 
workers including everyone who works for 
remuneration for at least 1 hour per week (6). 
In 2005 in Germany, this dependency ratio was 
1.27: There were five people who were not 
working for every four people who were (7). To 
determine the impact of demographic change, 
we can calculate the ratio keeping labor-force 
participation rates by age and sex at 2005 levels 
but using the population pyramid for 2025. This 
dependency ratio for Germany would then be 
1.47, with nearly three nonworkers for every 
two workers. Other European countries show 
essentially the same picture, but in the United 
States it improves. The United States faces a 
less daunting demographic future because wom- 
en (and men) in the United States are having 
about two children on average (compared with 
14 to 1’4 in much of Europe) because of siz- 
able immigration flows of young workers, and 
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because life expectancy has been relatively low 
and increasing relatively slowly. 

Our second indicator of demographic change 
is based on the number of hours worked per 
week per capita. Germans in 2005 worked an 
average of 16.3 hours per week (7). This value 
is so low because only 44% of Germans worked 
at all. Demographic change from 2005 to 2025 
will result in an 8% decrease. In France, Italy, 
and the Netherlands, population aging will re- 
duce the hours worked per week per capita 
by about 10%. To a rough first approximation, 
their economies will be smaller by 10% than 
they otherwise would have been. If productivity 
gains are large enough, Europeans may enjoy a 
somewhat higher standard of living 20 years 
from now even though they are working less. 
The distribution of work, however, will be even 
more unequal than it is today. People will be 
working less on average because more people 
will not be working at all. 


Working at Older Ages 


To keep dependency ratios and hours worked 
per week per capita at current levels, it is nec- 
essary for age-specific patterns of work to 
change. Consider Germany. The hours worked 
per week per capita in Germany in 2005 can be 
broken down by age (blue line in Fig. 2). If 
average effort is to be maintained at its current 
value of 16.3 hours per week, one option would 
be to increase work by people in their 50s and 
early 60s (red line in Fig. 2). Not everyone at 
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Fig. 1. Population pyramids for Germany in 1910, 2005, and 2025 (7). The data for 1910 do resemble a pyramid, with many children and few elderly 
people, but by 2005 there was a bulge of adults around age 40. This bulge will rise to age 60 in 2025. 
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these ages will be healthy enough to work, but 
a key finding of recent social science research 
is that as people live longer they tend to have a 
longer span of health (8). Social science re- 
search has also deepened our understanding of 
the relationship between health and retirement 
and between health and wealth (9). Further- 
more, the incentives that drive employee and em- 
ployer decisions about retirement age are now 
well understood (/0, 11). A knowledge base of 
demographic and economic theory and evidence 
exists to inform policymakers and the public 
about broad needs and options to increase em- 
ployment at older ages (12, /3). 

As the proportion of voters who are older 
than 50 grows, it may become more difficult to 
increase the age of retirement. As costs of sup- 
porting the elderly rise, expenditures on every- 
thing else, including research, education, and 
child care, may be reduced. This dismal pros- 
pect has received much press, but there is lit- 
tle evidence to either support or refute it (74, 75). 
In the United States and several European coun- 
tries, intelligent discussion of policy alternatives 
has created, to varying degrees, a climate of 
public opinion that recognizes, reluctantly, the 
need for an increase in the typical age of re- 
tirement. In contrast, in France and Italy, public 
discourse about retirement age (and other 
economic reforms) is woefully deficient. So- 
cial scientists could play a constructive role by 
participating more actively in public discus- 
sions and by putting more emphasis on policy- 
relevant research. 

Some of this research could focus on im- 
proving the productivity of older workers through 
better work environments and lifelong learning. 
Not everyone has the skills and interests to carry 
out particular tasks. What kinds of education and 
organizational arrangements are required to 
match the labor force with work needs? Many 
older workers may prefer part-time work. More 
studies are needed on how to organize 20- and 
30-hour work weeks so that they are profitable 
for organizations and satisfying for individuals. 


Redistribution of Work 


If part-time work becomes common for workers 
above 50 or 60, then more opportunities for 
part-time work may open up for younger peo- 
ple. As shown by the yellow line in Fig. 2, if 
people in their 60s and early 70s worked con- 
siderably more than today, then work effort 
could be evenly distributed at a level of about 
25 hours per week across ages 20 through 64. 
This level of effort could be achieved if a few 
percent were unemployed, a few percent worked 
40 hours per week, and the rest worked either 
20 or 30 hours per week. The ratio of non- 
workers to workers would be cut to a fraction 
of its current value. 

The 20th century was a century of re- 
distribution of income. The 21st century may be 


Table 1. Rostock indicators of demographic structure and change. R is the ratio of nonworkers to 
workers, with workers including everyone who works for remuneration for at least 1 hour per week 
(6). H is the number of hours worked per week per capita. The values for 2025 and the relative 
changes from 2005 to 2025 assume change in the population pyramid but no change in labor- 
force participation or effort by age and sex. For data sources, see (7). 


R, nonworkers per worker 


H, hours worked per week per capita 


Country 

2005 2025 Change 2005 2025 Change 
Germany 1.27 1.47 16% 16.28 14.95 —8% 
Denmark 0.97 1.12 15% 17.46 16.11 —8% 
France 1.43 1.69 18% 15.09 13.63 -10% 
Italy 1.59 1.86 17% 15.19 13.48 -11% 
Netherlands 1.01 1.20 19% 15.31 13.88 —9% 
UK 1.09 1.19 9% 17.32 16.34 -6% 
USA 1.09 0.99 —9% 18.71 18.29 -2% 
a century of redistribution of work. 30L 


Such redistribution would spread 
work more evenly across people 
and over the ages of life. Individ- 
uals could combine work, educa- 
tion, leisure, and child-rearing in 
varying amounts at different ages. 
This vision is starting to receive 
some attention from social scien- 
tists (16-21). Achieving it would 
require radical increases in oppor- 
tunities to work 20 or 30 hours per 
week. The Netherlands, Denmark, 
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and Norway may be harbingers of 10 


economies with many part-time 
jobs. Much more research, however, 
is needed on basic issues concerning 
the efficiency of such redistribution 
of work and whether individuals 
would prefer it. 

Future generations may think 
we (Europeans and Americans) 
were irrational about the way we 
spend the time of our lives. We 
concentrate work in those ages of life when we 
can have children and when children need the 
time and energy of their parents. Then, when we 
are in our late 50s or early 60s, we retire, 
enjoying decades of leisure, largely paid for by 
levies on younger adults who are also taking 
care of children. We concentrate the leisure of 
our lives in the years when we can no longer 
have children and when any children we did 
have no longer need the care they once required. 

A redistribution of work might make it 
easier for younger people to have the number 
of children they would like to have. The causes 
of low fertility in Europe, however, are complex 
and only partially understood (2, 22). Funding 
for research on policy options has been meager. 

How could parents support themselves and 
their children if they worked only 20 or 30 hours 
per week? If the need for transfer payments 
from workers to nonworkers were reduced, taxes 
and other levies could likewise be reduced. Fur- 
thermore, a greater fraction of women, at both 
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Fig. 2. Average hours worked per week by age in Germany 
(6, 7). The blue line graphs the pattern in 2005 that produced 
the overall level of 16.3 hours of work per week per capita. The 
red line shows the increase in work effort by older Germans 
required to maintain this overall level of effort in 2025. The 
yellow line illustrates one way to redistribute work more equally 
while maintaining the overall effort: People between 20 and 65 
would work 25.1 hours per week on average. 


younger and older ages, would be in the work 
force. In principle, it should be possible to re- 
distribute work while maintaining standards of 
living (/6). The specifics of how to do this, 
however, have to be worked out. 

Population aging is not going to stop in 2025. 
Long lives are the probable destiny of most people 
alive today in developed countries (/, 3-5). 
Extended life spans make life-course flexibility 
more desirable for individuals and societies 
(19, 20). Social scientists can develop knowl- 
edge about how to move from the stultifying 
regime in Germany, France, and most of the 
European Union to societies in which individuals 
have greater choice about how to spend the time 
of their lives. To have influence, social scientists 
will have to augment their fascination with new 
kinds of data collection and more sophisticated 
methods of statistical analysis with a deeper 
concern about making their research more 
directly relevant to policy issues. To supplement 
recent decades of micro analysis of individual 
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behavior, researchers need to perform more 
macro analysis at the population level and more 
analysis of micro-macro interactions. As dis- 
cussed by Butz and Torrey in this issue (23), 
such a reorientation could help social scientists 
contribute more effectively to understanding of 
demographic challenges and opportunities and a 
range of other important social issues. 
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PERSPECTIVE 


The Influence of a Sense of Time 
on Human Development 


Laura L. Carstensen 


The subjective sense of future time plays an essential role in human motivation. Gradually, 

time left becomes a better predictor than chronological age for a range of cognitive, emotional, 
and motivational variables. Socioemotional selectivity theory maintains that constraints on time 
horizons shift motivational priorities in such a way that the regulation of emotional states becomes 
more important than other types of goals. This motivational shift occurs with age but also appears 
in other contexts (for example, geographical relocations, illnesses, and war) that limit subjective 


future time. 


ost scientists would agree that the ex- 

plicit study of time falls in the pur- 

view of physics, yet interest in various 
aspects of time spans the natural and social 
sciences. Time is an integral part of virtually all 
psychological phenomena. From the sequenc- 
ing of rewards involved in operant and clas- 
sical conditioning to the flow of oxygen in 
the measurement of brain activation, time is 
built into most behavioral and psychological 
processes. Psychological science, however, 
has focused relatively little on the implications 
of our ability not only to monitor time but also 
to appreciate that time eventually runs out. I 
maintain that the subjective sense of remaining 
time has profound effects on basic human 
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processes, including motivation, cognition, and 
emotion. 

Although change over time is the basic foun- 
dation of developmental psychology, theoretical 
models of human development focus almost 
exclusively on the passage of time since birth. 
In child development, this marker has served 
scientists well. A substantial literature shows 
that chronological age is an excellent (albeit 
imperfect) predictor of cognitive abilities (/, 2), 
language (3), and sensorimotor coordination 
(4). At increasingly older ages, however, chron- 
ological age is a poorer predictor. Instead, in- 
creased heterogeneity or differentiation within 
samples is considered to be a cardinal feature of 
life-span development (5). Presumably, this is 
due primarily to differences in experiences and 
opportunities that individuals encounter over 
time. Chronic stress, level of education, close 


relationships, and social status all place individ- 
uals on very different developmental trajectories 
that affect not only day-to-day functioning 
but also health and longevity (6). Late in life, 
chronological age continues to provide a rough 
marker of accumulated life experience, but it 
loses the precision it holds in youth. 

A second index of time becomes salient as 
people grow older, namely the subjective sense 
of remaining time until death. Although cor- 
related with chronological age, this subjective 
sense of time gradually becomes more im- 
portant than time since birth. Because goal- 
directed behavior relies inherently on perceived 
future time, the perception of time is inextrica- 
bly linked to goal selection and goal pursuit. 
Socioemotional selectivity theory (SST), a life- 
span theory of motivation, is grounded funda- 
mentally in the human ability to monitor time, 
to adjust time horizons with increasing age, and 
to appreciate that time ultimately runs out (7). 
SST maintains that time horizons play a key 
role in motivation. Goals, preferences, and even 
cognitive processes, such as attention and mem- 
ory, change systematically as time horizons 
shrink. Because chronological age is correlated 
with time left in life, systematic associations 
between age and time horizons appear, but find- 
ings from experimental studies show that when 
time perspective is manipulated or controlled 
statistically, many age differences disappear. In 
short, across many dimensions, older and youn- 
ger people behave remarkably similarly when 
time horizons are equated. 

Events like the attacks on September 11th 
and the severe acute respiratory syndrome 
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(SARS) epidemic in Hong Kong completely 
eliminated age differences on some measures of 
motivation (8). Young men who suffered from 
HIV before effective treatments were available 
seemed to view their social world in the same 
way that very old people do (9). In all of these 
cases, the fragility of life was acutely primed. 
The subjective sense of time left was affected 
and, in turn, equated age differences in pref- 
erences and desires. 

SST maintains that two broad categories of 
goals shift in importance as a function of per- 
ceived time—those concerning the acquisition 
of knowledge and those concerning the regula- 
tion of emotion states. When time is perceived 
as open-ended, goals that become most highly 
prioritized are most likely to be those that are 
preparatory, focused on gathering information, 
on experiencing novelty, and on expanding 
breadth of knowledge. When time is perceived 
as constrained, the most salient goals will be those 
that can be realized in the short-term, sometimes 
in their very pursuit. Under such conditions, 
goals tend to emphasize feeling states, particu- 
larly regulating emotional states to optimize 
psychological well-being. SST predicts that peo- 
ple of different ages prioritize different types of 
goals. As people age and increasingly perceive 
time as finite, they attach less importance to 
goals that expand their horizons and greater 
importance to goals from which they derive 
emotional meaning. Obviously, younger people 
sometimes pursue goals related to meaning and 
older people pursue goals related to knowledge 
acquisition; the relative importance placed on them, 
however, changes. Indeed, differences between 
young and old are most striking when goals 
compete, such as situations in which expanding 
horizons also entail unpleasant emotional experi- 
ences. According to SST, in such cases younger 
people are far more likely than older people to 
pursue their goal despite the negative emotional 
burden. This theoretical shift has helped to 
make sense of a number of findings in the lit- 
erature previously referred to as the “paradox of 
aging” (10). Older people were observed to have 
smaller social networks, to be drawn less than 
younger people to novelty, and to reduce their 
spheres of interest; at the same time, however, 
they were as happy as (if not happier than) youn- 
ger people. This makes sense if motivational 
changes with age lead people to place priority on 
deepening existing relationships and developing 
expertise in already satisfying areas of life. 

However, according to SST, such differences 
are not due to “age” but to differences in the 
perception of future time. There are clear age 
differences in preferences, and these differences 
can be eliminated by selectively expanding or 
constraining time horizons (//, /2). For example, 
asked to choose among three social partners who 
represent different types of goals (13), the major- 
ity of older people reliably choose emotion- 


Capture those special moments. 


ARNOLO Camera 


Capture the unexplored world. 


ARNOLD Camera 


Fig. 1. An example of one pair of advertisements used to study age differences in preferences and 
memory for products. In each pair, the advertisements were identical except for the slogan. One slogan 
was related to gaining knowledge. The second promised an emotionally meaningful reward (24). 


ally close social partners. Yet when 
asked to make the choice after 
imagining that they just received a 
telephone call from their physician 
who told them about a new medical 
advance that virtually ensures they 
will live far longer than expected, 
older peoples’ choices resembled 
those of younger people (/2). Sim- 
ilarly, when younger people are 
asked to imagine that they will soon 
move to a new geographical loca- 
tion, they “look like” older people: 
they, too, now choose emotionally 
close social partners (//). Thus, end- 
ings need not be related to old age 
or impending death. They need 
simply to limit time horizons. Pref- 
erences long thought to reflect in- 
tractable effects of biological or 
psychological aging appear fluid 
and malleable. 

We began to explore the ways in which these 
different motivational states influence informa- 
tion processing. Helene Fung and I developed 
pairs of advertisements that were identical ex- 
cept for the featured slogan (/4). In one ver- 
sion of the advertisements, the slogans promised 
to expand horizons. In the other, the slogans 
promised more emotional rewards (Fig. 1). The 
majority of older participants preferred the ad- 
vertisements featuring the emotion-related slogans. 
They also remembered these slogans and the 
products associated with them better than they 
did the slogans about exploration and knowl- 
edge. When older participants were asked to 
imagine an expanded future before they indicated 
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Fig. 2. The percentage of signal change in amygdala activation 
in response to emotionally positive, emotionally neutral, and 
emotionally negative images. Younger people show significantly 
increased activation in response to positive and negative 
images. Older people show increased activation only in response 
to positive images (20). [Adapted from Mather ef al. (2004)] 


their preference, they made choices similar to those 
made by younger participants, that is, they failed to 
show a significant preference for the emotion- 
related slogans. 

Recently, research has indicated a special 
preference for emotionally positive information 
over emotionally negative information in mem- 
ory in older adults (/5—/7). This is particularly 
intriguing because it has long been known that 
younger people find negative information more 
attention-grabbing and memorable than positive 
information. Indeed, many have posited an 
evolutionary basis to a preference in memory 
and attention for negative information. Nega- 
tive material is richer in information than is 
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positive material, which often soothes instead of 
arouses. If the value placed on learning new 
information changes with shrinking time hori- 
zons, however, this preference should dissipate 
across adulthood. Our research team has coined 
the term the “positivity effect” to describe a de- 
velopmental pattern that has emerged in which a 
selective focus on negative stimuli in youth shifts 
to a relatively stronger focus on positive in- 
formation in old age (/6). Although in some 
studies, the effect is accounted for primarily by 
younger people remembering relatively more 
negative material than positive material, and in 
other studies the effect is accounted for by older 
people remembering more positive than negative 
material, a shift in the ratio of positive to negative 
across age groups has nevertheless emerged as a 
reliable finding in the research literature (18, 19). 
Of particular interest is recent evidence that 
older people process negative information less 
deeply than they do positive information (20). 
While in a brain scanner, older and younger 
people viewed images of positive, negative, and 
neutral stimuli. Using event-related functional 
magnetic resonance imaging, activation in the 
amygdala was measured in response to the dif- 
ferent types of images. Consistent with the re- 
sults of the behavioral studies noted above, 
whereas younger adults showed heightened 
amygdala activation in response to both posi- 
tive and negative images compared with neutral 
images, amygdala activation in the older adults 
increased only in response to the positive im- 
ages (Fig. 2). Thus, not only at recall but at 
very early stages of processing, older adults 
diminish encoding of negative material. 


SST suggests that many differences between 
younger and older people that have long been 
believed to reflect intractable age differences in 
attitudes or the consequences of age-related de- 
cline may be neither. Young or old, when people 
perceive time as finite, they attach greater im- 
portance to finding emotional meaning and sat- 
isfaction from life and invest fewer resources 
into gathering information and expanding hori- 
zons. Tests of hypotheses derived from SST 
have shed light on the literature showing that, 
although social networks grow smaller, they also 
grow more satisfying. Older people appear to 
prefer such social networks. Hypotheses gen- 
erated by SST have led to discoveries of 
differential decline in the processing of certain 
types of information, suggesting that motiva- 
tion contributes to at least some observed age 
differences. As illustrated in the study of adver- 
tisement preferences described above, under- 
standing these shifts in motivation can help us 
to frame information for older adults such that 
it is more memorable. It also may be that 
special reliance on emotional responses to op- 
tions will aid decision-making. Of course, a 
focus on emotionally satisfying stimuli may be a 
double-edged sword. Preferential attention to 
positive information, for example, may con- 
tribute to susceptibility to scams or other un- 
scrupulous efforts to take advantage of older 
people. Many questions remain. It appears, how- 
ever, that consideration of time horizons can 
offer insights into the ways in which younger 
and older people differ, but also show that 
behavioral differences are often driven by the 
same underlying mechanisms. 
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